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(54) Humanized monoclonal antibodies. 



(57) A method of preparing humanized monoclonal antibodies is described which utilizes comparative 
model burding methodology. A humanized anti-CD18 antibody, 60.3, has been formulated and 
demonstrated to have analogous binding characteristics to the original murine monoclonal antibody, 
while displaying essentialy complete human Ig heavy and light chains. 
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Technical Field 

The present invention is directed to a method for producing humanized monoclonal antibodies by utilizing 
comparative model building to construct the humanized antibody from homologous regions of human proteins 
5 by rational design. Specific humanized monoclonal antibodies are prepared. 

Background of the Invention 

Murine derived monoclonal antibodies have been utilized as diagnostic and therapeutic agents for numer- 

10 ous human pathologic conditions including acute inflammatory responses associated with numerous diseases. 
Administration of murine derived monoclonal antibodies (mAbs) as therapeutic agents in man has been se- 
verely limited by the development of antibody within the recipient to the mouse antigens of the murine derived 
monoclonal antibody. In attempts to circumvent this outcome mAbs have been restructured by recombinant 
DNA technology in such a way as to decrease their immunogenicity in humans. Immunoglobulins are well de- 

15 fined both chemically and biologically with the general structures illustrated in Molecular Cell Biology. Darnell, 
Lodish and Baltimore, Eds.Scientific American Book, Inc. W.H. Freeman, New York, NY (1986). Initially, this 
involved the construction of chimeric antibodies, Morrison et al. Proc. Natl. Acad. Sci. USA 81: 6851-6855 
(1 984). Recombinant technology was employed to replace the murine heavy and light chain constant regions 
with corresponding human constant regions. Upon expression, such interspecies antibody chimeras yielded 

20 molecules with the antigen binding specificities of the parent murine antibody. The following references gen- 
erally describe chimeric antibody technology: Lobuglio etal. Proc. Natl. Acad. Sci. USA86: 4220-4224 (1 989) : 
United States Patent 4,81 6,567 ; PCT International Publication No. WO 87/02671, published May 7, 1987 ; Eu- 
ropean Patent Publication No. 255,694, published February 10, 1988 ; European Patent Publication No. 
274,394, published July 13, 1988 ; European Patent Publication No. 323,806, published July 12, 1989 ; PCT 

25 International Publication No. WO/89/00999, published February 9, 1989; European Patent Publication No. 
328,404, published August 16, 1 989; European Patent Publication No. 322, 424 published September 13, 1989, 
and European Patent Publication No. 438,310 published July 24, 1991. 

The immunogenicity of chimeric antibodies can be further reduced by grafting rodent hypervariable re- 
gions into the variable region frameworks of human light and heavy chains, Jones et al. Nature 321 : 522-525 

30 (1 986). These hypervariable regions have also been termed complementarity determining regions (CDR). The 
technique involves the substitution or recombinant grafting of antigen-specific murine CDR sequences for 
those existent within "generic" human heavy and light chain variable regions, European Patent Publication No. 
239,400, published September 30, 1987. In this approach, little, if any, concern is shown for the variable region 
frameworks (FRs) within which the murine CDR's are placed. 

35 Studies by Queen etal , Proc. Natl. Acad. Sci. USA 86 : 10029-10022 (1989), have shown the CDRs from 

a murine anti-Tac monoclonal antibody can be grafted into a human framework. The human framework variable 
regions were chosen to maximize identity with the murine sequence. The authors also utilized a computer mod- 
el of the mMab to identify several amino acids which, while outside the CDRs, are close enough to interact 
with the CDRs or antigen . These residues were mutated to the residue found in the murine sequence. The 

40 grafted anti-Tac antibody had an affinity for the antigen which was only about 1/3 that of the murine anti-Tac 
mAb. 

Leukocyte infiltration into an inflammatory site is dependent on the adhesion of the leukocytes to the en- 
dothelium priorto extravasation. The rapid binding of polymorphon u clear leukocytes (PMN) to the endothelium 
and diapedesis occurs within minutes after the introduction of a chemotactic stimulus in tissue, Cybulski et al ., 

45 Am. J. Pathol. 124 : 367 (1986). This rapid extravasation appears to depend on the response of the PMNs to 
chemoattractants and on the presence of the CD11/CD18 family of glycoproteins on the leukocyte surface. 
The family of glycoproteins associated with PMNs are termed leukocyte integrins and include LFA-1 
(CD11a/CD18), Mac-1 (CD11b/CD18) and p150,95 (CD11c/CD18). Each of these heterodimers has a unique 
alpha chain (CD11 a, b, c) and an invariant beta-2 chain (CD1 8). Stimulation of PMNs with various chemotactic 

so factors causes increased expression of leukocyte integrins (CD11 b/CD18) fostering strong adhesion to unsti- 
mulated endothelium i n vitro ; Harian, Blood 65 :513 (1985). Essentially all of the chemoattractant- induced ad- 
hesion is inhibited by treating the PMNs with mMAbs specifically reactive with the CD11/CD1 8 complex, Harian 
et al. , Blood 66 : 167 (1985); Zimmerman and Mclntyre J. Clin. Invest 81 : 531 (1988); Smith etal. , J. Clin . 
Invest 82 : 1746 (1988) and Lo et al. , J. Exp. Med. 169 : 1779 (1989). 

55 Murine hybridomas producing monoclonal antibodies reactive with the beta chain common to the Mac-1, 

LFA-1 and the p150,95 integrins have been described. The mMAbs are designated 1B4, 60.3, TS1/18, H52 
and ATCC TIB 218. The 1 B4 is an lgG1 antibody and was prepared by Wright et al. Proc. Natl. Acad. Sci. USA 
80 : 5699-5703 (1983), the 60.3 is also lgG2a and was prepared by Beatty et al. J. Immunol. 131 :291 3-291 8 
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(1 983), TS1/18 is an lgG1 antibody and was prepared by Sanchez- Mad rid etal. , J. Exp. Med. 158 : 1785-1803 
(1983), and ATCC TIB 218, a lgG2a kappa prepared by Springer et al , J. Exp. Med. 158 : 586-602 (1983). 
These antibodies appear to be functionally equivalent and cross-react with the beta 2 - chain found on human, 
sheep, pig, rabbit and dog leukocytes but not with the beta-2 chain found on murine and rat leukocytes. 

5 

Summary of the Invention 

The present invention is directed to a method for producing humanized monoclonal antibodies by utilizing 
a process of comparative model building. In this method computer data bases are searched to locate homo- 
10 logous human protein sequences that correspond to specified regions of the non-human derived (usually mur- 
ine) antibody, and a series of models is formulated, tested and modified to produce a model of a humanized 
antibody which is then constructed by recombinant DN A technology. In a preferred embodiment, a humanized 
monoclonal antibody corresponding to the murine anti-CD18 antibody 60.3 was prepared. 

The variable (V) region sequences from both the heavy (H) and light (L) chains were determined from cDNA 
15 (amplified by PCR), and spliced onto human constant (C) regions, resulting in a chimeric 60.3 Ab (lgG1, kappa). 
The chimeric Ab was expressed in tissue culture (Ag8.653 mouse myeloma cells, detected by ELISA), and ex- 
amined in binding assays. The results from competition and inhibition assays showed that the chimeric Ab was 
as effective as the murine 60.3 mAb. 

The deduced murine V H and V L protein sequences were compared to the protein sequence data base, and 
20 two human Ig protein sequences were selected to be used as templates. The present inventors modeled a mur- 
ine 60.3 Fv according to the deduced V H and V L protein sequences. Based on the 60.3 Fv model and the two 
human template sequences selected from the protein data base, a humanized Fv was modeled. 

Construction of the humanized 60.3 was done by piecing 5 pairs of complementary oligonucleotides to- 
gether (spanning the entire V region) to form the VH and VL. These were then attached onto vectors containing 
25 genes for appropriate C regions to form humanized Ab (lgG1 , kappa). The humanized proteins were again ex- 
pressed in Ag8.653 cells and binding assays were done. FACS analyses indicated that the humanized Ab rec- 
ognized cells expressing CD18. About a dozen of the humanized 60.3 Ab master wells were transferred and 
assayed for Ig. 

30 Brief Description of the Drawings 

In the drawings: 

Figure 1 illustrates an amino acid comparison of the murine 60.3 antibody heavy chain (m60.3) with the 
human variable heavy chain consensus sequence for the framework regions of human subgroup V H 1 
35 (hVh1/Jh4), the human template (M030) used for humanization (h60.3 template), a germline sequence hom- 
ologous to M030 (21-2 'CL), and phases I to IV of the humanization process. All amino acids which are identical 
to the phase IV sequence are shaded. In addition, sequences which are different than the previous phase se- 
quence are shown in bold. 

Figure 2 illustrates an amino acid residue comparison of the murine 60.3 antibody light chain (m60.3) with 
40 the human variable light chain consensus sequence for the framework regions of human subgroup 
V k lll(hVklll/Jk), phases l-IV of the humanization process and the human template used for the humanization 
(h60.3 template). All amino acids which are identical to the phase IV sequence are shaded. In addition, se- 
quences which are different than the previous phase sequence are shown in bold. 

FIGURE 3 illustrates five pairs of complementary oligonucleotides corresponding to the variable regions 
45 of the light chain. 

FIGURE 4 illustrates five pairs of complementary oligonucleotides corresponding to the variable region 
of the heavy chain. 

FIGURE 5 illustrates the binding of murine, chimeric and humanized 60.3 antibody to HL60 human mye- 
lomonocytic cells. Fluorescein isothiocyanate (FITC) labelled antibody was incubated with cells at the concen- 
50 trations indictated on the abscissa and the amount of antibody bound is indicated by relative fluorescence in- 
tensity on the ordinate. 

FIGURE 6 illustrates the competition by preincubation of cells with chimeric and humanized 60.3 of the 
binding of FITC-labelled murine 60.3 to HL60 cells. HL60 cells were preincu bated with 1 ug/ml of either chimeric 
60.3 (circles) or humanized 60.3 (squares), followed by incubation with various concentrations of FITC-labelled 
55 murine 60.3. In the absence of competing antibody, FITC - m60.3 binding to the HL60 cells increased with 
increasing concentration (x). 

FIGURE 7 illustrates the direct competition of FITC-murine 60.3 binding to HL60 cells by chimeric and hu- 
manized 60.3. The dashed line shows the fluorescent intensity of binding by FITC-murine 60.3 in the absence 
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of competitor, while additions of increasing concentrations of chimeric 60.3 (squares) and humanized 60.3 (cir- 
cles) inhibited FITC-m60.3 binding. 

FIGURE 8 illustrates the results of a chemiluminescence binding assay of murine (closed square), chimeric 
(open square) and humanized (closed diamond) 60.3 antibody upon HL60 cells. The anticancer antibody L6 
5 (open diamond), which does not bind to HL60 cells, was used as a control. 

FIGURE 9 illustrates a series of restriction maps for plasmids utilized in the production of the variable light 
chain plasmid pGK.11. 

FIGURE 10 illustrates the nucleotide sequence for the humanized variable light chain. 

FIGURE 11 illustrates a series of restriction maps for plasmids utilized in the production of the variable 
10 heavy chain plasmid pNy1.16. 

FIGURE 12 illustrates the nucleotide sequence for the humanized variable heavy chain. 

Description of Preferred Embodiments 

15 The present invention is directed to a method of producing humanized monoclonal antibodies (mAbs) by 

utilizing a process of comparative model building and rational design. In a preferred embodiment this method 
is utilized to produce a humanized molecule of the anti-CD18 murine monoclonal antibody 60.3. The mouse 
mAb 60.3 (lgG2a), which recognizes a functional epitope on the beta subunit (CD18) of leukocyte integrins, 
prevents the adherence and aggregation of polymorphonuclear neutrophils (PMN), resulting in a blockage of 

20 neutrophil mediated damage during shock and reperfusion injury. It would be advantageous to modify mAb 
60.3 to reduce the potential for HAMA (human anti-mouse Abs) response. Therefore, in one embodiment, the 
present invention was directed to "humanize" the 60.3 mAb by creating an Ab whose constant (C) region is 
human, but whose variable (V) region is composed of both human (principally framework sequences) and 
mouse (principally CDR loops) sequences. For the studies described in the present invention, murine, chimeric 

25 and humanized antibodies were purified from solution by protein A chromatography on I PA - 400 Fast Flow 
Immobilized rProteinA(Repligen, Cambridge, MA) using the manufacturer's recommended protocol. 

In the present invention recombinant methods are utilized to produce humanized monoclonal antibodies 
that contain complementarity determining regions (CDRs) analogous to the originally derived monoclonal an- 
tibody, and which have homologous human heavy and light chain framework regions. The resulting antibodies 

30 demonstrate the binding affinity and specificity of the original antibody yet are completely humanized mono- 
clonal antibodies. 

As used herein the term "humanized" and its various grammatical forms as it relates to antibodies is de- 
fined to mean that the amino acid residues of the antibody in the heavy and light chains are replaced with 
amino acid residues corresponding to homologous human protein regions without altering the binding activity 

35 of the antibody. For the humanized 60.3 monoclonal antibody of the present invention there is approximately 
80% sequence identity of the variable regions of the heavy and light chains with those of the human mAb, 
while the constant regions are distinctly human. Some variation of individual amino acids in the antigen binding 
and framework regions are contemplated by this invention and are within the scope of this invention when such 
variations do not interfere or inhibit the binding to antigen, such as the He for Glu substitution at position 106 

40 of the light chain. 

As used herein the term "canonical loop conformation" refers to a small repertoire of main chain confor- 
mations for five of the six loops (all except H3). The particular conformation adopted is determined by only a 
limited number of residues within the loop or the framework. 

As used herein the term "framework residues" means residues which are located outside the structurally 
45 defined CDR loops. These residues can be part of the hypervariable regions for the antibody. 

As used herein the term "monoclonal antibody" refers to all recombinant antibodies derived from an initial 
single cell and includes murine monoclonal antibodies, chimeric antibodies and humanized antibodies. 

In the present invention a procedure of comparative model building was utilized to construct the appro- 
priately designed humanized antibody. As a preferred embodiment, the modeling of the murine 60.3 antibody 
50 is summarized. 

The Brookhaven Protein Database (Bernstein et al. (1977) J. Mol. Biol. 122 : 535-542) was searched for 
the antibody crystal structures which show reasonably high homology (> 50% sequence identity) to the variable 
regions of murine 60.3. If the variable light chain and heavy chain templates which fulfill these criteria are from 
different antibodies, these structures are combined by superposition of the set of structural invariants at the 
55 V L - V H domain interface (Novotny et al. (1985) Proc. Natl. Acad. Sci. USA 82 : 4592-4596). This provides the 
"structural template" for model building of murine 60.3 (and humanized 60.3 below). 

The CDR loops and their known structural framework determinants of murine 60.3 are determined by de- 
fining the CDR loops structurally according to the method of Chothia et al. (Chothia etal. (1989) Nature 342 : 
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877- 883). The structurally defined CDR loops consist on average of shorter sequence segments than the hy- 
pervariable regions defined by Kabat (Kabat et al (1987) Sequences of Proteins of Immunological Interest). 
The five canonical CDR loops (L1-13, H1-H2, i.e. all except H3) in the 60.3 variable light and heavy chain are 
assigned to known canonical loop conformations, and the framework residues which are crucial for the con- 

5 formations of the CDR loops are determined. 

The non-canonical H3 loop region within the 60.3 sequence is defined and a model of murine 60.3 is then 
built. The CDR loops of the structural template are replaced with canonical CDR backbone templates as de- 
termined using interactive computer graphics (INSIGHT II, Ver. 2.0 Biosym Technologies, Inc. 1991). Loop 
searches (Jones TA(1986) Embo J. : 819-822) in the Brookhaven Protein Data Bank are carried out to extract 

10 an initial backbone approximation for the non-canonical CDR loop H3. 

All non-conserved amino acid side chains in similar positions are replaced using interactive computer 
graphics. The model then consists of a combination of backbone fragments of different antibodies with replaced 
side chains. The model is solvated with a 7 A water layer and the structure is refined using an energy minimi- 
zation (Mackay et al. (1989) Prediction of Protein Structure and the Principles of Protein Conformation. New 

15 York: Plenum Press pp.31 7-358) protocol where, over the course of 1600 cycles of conjugate gradients mini- 
mization, constraints of 80 kcal/mol/A 2 on all protein non-hydrogen atoms are gradually released until, at the 
final stage of the minimization procedure, all atoms of the system are free to move. 

The most homologous human variable region sequences are found by searching the sequence data base 
for the most homologous human sequences for the variable light and the variable heavy chains of the 60.3 

20 antibody and these sequences are combined to obtain a "human template". The structural template for murine 
60.3 is confirmed to be suitable for the human template. The sequences of the structural template chosen ini- 
tially showed > 50% sequence identity to the variable regions of the "human template". Furthermore the per- 
cent homology is chosen to be similar to that found for comparison of the structural template with the murine 
sequence. The CDR loops and known structural determinants are then grafted onto the human template 

25 (Jones et al. (1986) Nature 321 : 522- 535). The CDR loop regions and structural determinants in the "human 
template" sequence are replaced by the analogous sequences from the murine antibody, as determined above. 
This provides the Phase I h60.3 sequence. A Phase I model of humanized 60.3 is built using the same model 
building protocol as described for murine 60.3. 

In Phase II the murine and Phase I h60.3 models were compared. These models now consist of the murine 

30 binding site and murine framework (murine 60.3 or m60.3) and of murine binding site and human framework 
(Phase I humanized 60.3 or h60.3). The of murine and Phase I h60.3 were superimposed using the structural 
invariants of the immunoglobulin fold (Novotny et al. (1985) Natl. Acad. Sci. USA 82 : 4592-4596). The models 
of the binding site regions were compared residue by residue from the N-terminus to the C-terminus. By this 
comparison, all framework residues and residues within the framework - CDR junctions which can interact with 

35 the murine CDR loops and may therefore be important for the structural integrity of the murine binding site 
were identified. These residues typically include all the known structural determinants for the specified can- 
onical CDR loop conformations (Chothia et al. (1989) Nature 342 : 877-883) and other residues found to be 
critical in the comparison (due to proximity to the CDR loops and potential for interaction with them). These 
residues were then "re-mutated" to the murine residues, forming the Phase II h60.3 model. 

40 The murine 60.3 model and the modified humanized sequence were then further refined by again sub- 

jecting the models to the energy minimization procedure described above. This construct represents the Phase 
II h60.3 model. 

In Phase III, further improvements of the structural model of h60.3 were made A conformational search 
(Bruccoleri RE and Karplus M. (1987) Biopolymers 26 : 137- 168) was carried out over regions of the binding 

45 site which cannot be directly assigned to known structural templates. Typically, this is the CDR loop H3 and 
perhaps one or more CDR loops which may not belong to known canonical structure types. Side chain con- 
formations of the antigen binding site loops and the framework - CDR junctions are also further refined using 
an iterative conformational search protocol (Bruccoleri RE and Karplus M. (1987) Biopolymers 26 : 137-168). 
The refined model structure may be called Phase III h60.3. 

50 in Phase IV, analysis of the binding site features of the Phase III h60.3 model was carried out. The binding 

site features of the construct were analyzed in detail in order to classify the antibody structure, for example, 
as a "groove-type" or "cavity-type" or "flat" antibody. This allows one, in the absence of detailed structural 
knowledge of the antibody-antigen complex, to postulate which parts of the CDR surface or residues at the 
CD R-framework junctions are unlikely to be involved in antigen binding. In the Phase III and earlier models, 

55 these positions may be occupied by murine residues which can now be changed to human residues. 

This improvesthe "degree of humanization" of the antibody since parts of some CDRIoopsand other entire 
CDR loops can be "humanized". At this stage, the final version of humanized 60.3, Phase IV h60.3 was ob- 
tained. 
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Comparative molecular modeling has been used here to enable a detailed three-dimensional comparison 
of a murine antibody and its humanized version. This comparative study has enabled the present inventors to 
analyze residue-residue interactions which are likely to be critical to retain the murine specificity in the struc- 
tural context of a largely human antibody. Furthermore, the different modeling concepts based on structural 

5 homology (experimental structural data) and conformational search (which represents an abinitio method) 
have been combined to obtain the best possible picture of the 60.3 binding site in order to, gain some insight 
into which of the binding site residues may be not involved in antigen binding. 

In addition to the application summarized above, comparative model building can be applied to other prob- 
lems. For example, many of the antibody structures which are modeled today are used to guide mutagenesis 

10 experiments in order to explore affinity and antigen specificity. Such antibodies are often modeled because 
experimental structures are not available for these antibodies. Comparative model building provides an op- 
portunity to assess the confidence level of such theoretically derived structures. 

For example, the combining site of a clinically relevant antibody can be derived starting from different struc- 
tural templates and employing the different met hods based on structural homology and conformational search. 

15 By pairwise combination of two different templates with two different methods, four model structures can be 
derived in an independent way and then compared by superposition of structural invariants. This comparison 
allows for the determination of how well the independently derived structures agree and which parts of the 
models do not show satisfying agreement. In the absence of experimental structural data, such comparative 
model building exercises presently provide the only way to assess the confidence level of antibody models. If 

20 the independently derived structures agree well, a high confidence level can be assigned to the model and a 
"consensus model" can be prepared. The consensus model would then typically represent a combination of 
structural elements derived by structural homology and conformational search. On the other hand, disagree- 
ment of the models allows for the identification of the particular critical regions in model structures which are 
less well defined and need to be improved or, if this is not possible, treated with caution. Such knowledge, 

25 obtained by comparative model building, is very important for the use of model structure for experimental de- 
sign. 

Although the humanized 60.3 Ab was prepared by grafting the murine CDRs onto the human frameworks, 
certain amino acids were not changed from the murine protein sequence to their human counterpart (due to 
their importance in retaining the conformation of the CDR loops). Therefore, three humanized 60.3 L "mutants" 
30 (based on computer modeling) were constructed in an effort to 1) further reduce its divergence and 2) deter- 
mine the contribution of these amino acids on antigen (Ag) binding. 

There are 4 amino acids in the Lchain which are changed: one is in the CDR2 (postulated not to be involved 
in binding) and the remaining amino acids all reside in the framework 2 region. The three humanized 60.3 L 
"mutants" are as follows: 
35 1) Mutant 1: amino acid change is only at position 50, from Arg (R) to Asp (D). 

2) Mutant 2 : 3 amino acids changes are carried out at positions 50 (as in Mutant 1), 54 [from Leu (L) to 
Arg (R)] and 55 [from Glu (E) to Ala (A)]. 

3) Mutant 3: contains the changes in Mutant 2, and at position 68 Arg (R) is changed to Gly (G). 

From this set of mutants, effects can be seen on binding by (1) Arg 68 alone (by comparing results from 
40 Mutants 3 and 2) and (2) Leu 54 and Glu 55 (by comparing results from Mutants 2 and 1). 

Total cellular RNA was extracted from a 60.3 producing cell-line according to the method of Davis et al 
(1986). The first strand cDNAwas synthesized using a cDNA synthesis kit from InVitrogen and an oligo (dT) 
primer. The cDNA was then amplified by polyermase chain reaction (PCR) using degenerate primers (Larrick, 
etal.(1991) Scand. J.. Immun. 32 :121-128; Colloma and Larrick (1991) Biotechniques 11 : 152-156). For the 
45 heavy chain, the sense primers (MH-SP-ALT.1 and MH-SP-SLT.2) were from the signal peptide and had an 
Xho I restriction site and the 5' end. The antisense primer (MH- gamma-CONST) was a consensus sequence 
from the CH1 domain of murine y heavy chains and had a Pst I site at the 3' end. 

For the PCR amplification of the light chain, the sense primers (EcoRI/FR1-ML (k)) were from the 5' end 
of the first framework region and had an Xho I restriction site at the 5' end. The antisense primer (Hindlll/M1(k- 
50 )CONST) was from the k constant region, but had a Sal I restriction site instead of the Hind III site described 
by Larrick et al . 

For both the heavy and light chains, the PCR products were either restricted (Xho I and Pst I for the heavy 
chain; Sal I and Xho I for the light chain) and cloned into similarly restricted pUC 18 or treated with nucleotide 
kinase followed by blunt ended ligation into Sma I digested pUC 18. Ligation products were used to transform 
55 competent DHa E. coli cells. 

Clones containing inserts were selected using X-Gal/PTG; positive clones were screened for appropriately 
sized EcoR l-Sal I restriction fragments. EcoR I and Sal I flank the cloning sites in pUC 18 and are therefore 
expected to release the PCR product producing an approximately 0.5kh fragment. Selected-clones were sub- 
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jected to the double stranded DNA sequencing (Hsiao (1991) Nucl. Acid Res 19 :2787) using Sequenase 
(U.S. Biochemical). The sequence is shown in Figures 10 and 12, and Sequence I.D. numbers 9 and 10. The 
V gene sequences were compared to sequences of other murine Ig genes (Kabat et al , (1987) Sequences of 
Proteins of Immunological Interest, 4th ed., Nat. Inst, of Health, Bethesda, MD.) The heavy chain was found 

5 to belong to the VH Ma subgroup and the light chain to belong to the Vk 1Mb subgroup. 

In order to be sure that the correct V genes had been cloned and sequenced, heavy and light chain from 
purified 60.3 were subjected to N-terminal amino acid sequencing. The amino acid sequence of the heavy chain 
was identical to that deduced from the DNA sequence. However, there was a discrepancy for the 7th and 8th 
amino acids of the L chain. For the DNA, these amino acids (Tyr and Gin, respectively) are encoded by the 

10 sense primer used for the PCR reaction. Ser and Pro were found at these positions by amino acid sequencing. 
Furthermore, almost all other Vk genes of this subgroup had Ser and Pro at this position. It was concluded 
that the primers used were not entirely appropriate for the V gene. They were however, similar enough to the 
cDNA that annealing and priming could occur. The codons for these 4 amino acids occur at the 3' end of the 
primer and are as follows: 

is Tyr Gin : TA (C/T) CA (A/G) 

Ser Pro : TCX CCX 

To determine the real sequence at this position, the entire procedure (cDN A synthesis, PCR amplifying, 
cloning, and sequencing) was repeated using primers which terminated before the codons in question. This 
20 showed that Ser and Pro were encoded at positions 7 and 8, respectively. The initial PCR product was reP- 
CRed, using a sense primer which encoded Ser and Pro rather than Tyr and Gin at positions 7 and 8. 

For each V gene, 2 PCR primers were synthesized. The amplification of genes by PCR, cloning into Pud 8, 
as well as the double stranded DNA sequencing were all done as described above. The PCRed V genes were 
then cloned into the expression vectors PNy 1.16 and PGk11 which have human constant regions (Fig. 9 and 
25 11) before transfection into the mouse myeloma cell line, Ag8.653. 

The VL and VH genes were inserted into pGk.11 and pNy1.16, respectively, by amplifying the genes by 
PCR adding restriction site and intron sequences at the 5' and 3' ends. 

The sense primers for both the H and L chains contained within their sequences the following in the 5' - 
3' direction: 
30 1 . N6, 

2. restriction site for cloning (EcoRI and Sacl for the H chain; EcoRI and Hindlll for the L chain; EcoRI is 
for cloning into pUC18, Sacl and Hindm are for cloning into the expression vectors), 

3. branch point lariat signal, 

4. polypyrimidine tract, 
35 5. splice acceptor site, 

6. leader peptide 2, 

7. the beginning of the FR1 of the V gene. 

The reverse complement of the antisense primer contained: 
1 . 3' end of VDJ gene for the H chain and of the VJ gene for the L chain, 
40 2. splice donor signal, 

3. restriction site for cloning (Sail and Xhol for both H and L chains; Sail is for cloning into pUC 18, Xhol 
is for cloning into the expression vectors), 

4. N6. 

The sense primer for the heavy chain was: 

45 

S'a^GAATTCGAGCTCTTITrCTGATAA 
GTCCAGTGTCAGGTCCAACTTCAGCAGCCTGGG3' 



so The anti-sense primer for the heavy chain was: 

5 ! A^GTCGACCTCGAGTGTGAGGACTCACCTGAGGAGACGGTGACTGAGGT 
GCCT3' 

The sense primer for the light chain was: 

55 

S'agGAATTCAAGCl'lUCCTGACTACATGAGTGCATTTCTGll'll'ATTTCCA 
ATTTC AG ATACC ACCGG AG AC ATTGTGCTAAC AC AATCTCCA3 ' . 
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The anti-sense primer for the light chain was: 

S'A^GTCGACCTCGAGATCACTTACGTTTGATTTCCAGCTTGGTGCCTCCAC 



The amplification of genes by PCR, cloning into pUC18 and the double stranded DNA sequencing were 
done as described above. The PCRed V genes were the cloned into the expression vectors. For the heavy 
chain, the V gene was directionally cloned into the Sac I and Xho sites of pNy1.16. For the light chain, the V 
10 gene was directionally cloned into the Hindlll and Xho I sites of pGk.11. 

Humanized VH and VL genes were constructed by oligos and is described in detail herein below. The in- 
sertion of humanized V H and V L genes into pNy1.16 and pGk.11, respectively were done by the procedures 
described for the chimeric Ab. The sequences needed for cloning and expression were built in (or included in ) 
in oligo #1 and #10. 

15 The presence of the chimeric 60.3 Abs were detected by ELISA. In this assay, 96 well plates were coated 

with goat anti-human IgG. The chimeric 60.3 in the sample which bound to the plates were detected by using 
horse radish peroxidase (HRPO) conjugated human anti-kappa IgG. Purified chimeric L6 (an unrelated anti- 
tumor antibody) was used as a standard. The culture supernatants from Ab producing clones were selected 
for binding in FACS binding assays. 

20 FACS Binding Assays 

HL60 is a human myelomonocytic cell line which expresses CD 18 on the cell surface. 
HL60 target cells, grown in culture media (DM EM, 1% fetal calf serum and 2% L-Glutamine), were made 10 7 /ml 
in the culture containing 0.1% NaAzide at 4°C. 1 0 6 cells were used per binding assay. Bovine IgG (Sigma, 20 
ug/ml final concentration) was added for 1 0 min. at4°C to mask the Fc receptors before performing the assays. 

25 Binding assays were done as direct binding, pre- incubation or competition assays. 

1) Standard binding assays 

Various amounts of murine, chimeric or humanized 60.3 were incubated for 45 minutes with HL60 cells, 
washed and then incubated with FITC-labeled goat anti-mouse (for murine 60.3) antibody (Tago, Inc.) at 
1 :50 dilution or FITC-labeled goat anti-human antibody (for chimeric and humanized 60.3.) The fluores- 
30 cence intensity was then monitored. 

2) Pre-incubation assays 

Pre- incubation experiments were done with murine 60.3. For pre-incubation with either chimeric 60.3, or 
humanized 60.3, 1000 ng/ml of either c60.3 or h60.3 was added to the HL60 cells for 45 minutes to 1 hr 
at 4°C. Cells were then washed twice with 1 ml of ice cold NaAzide-containing culture media and centrif- 
35 uged in a Beckman table top centrifuge (Model TJ-6) at 1000 rpm for 5 min. After decanting washes, cells 

were resuspended in various concentrations of FITC-conjugated murine 60.3 Abs and incubated on ice for 
45 min. to 1 hr. with intermittent mixing at 10min intervals. Cell pellets were fixed in 300 ul of 1 % paraf- 
ormaldehyde at 4°C before analysis of FACS machines. 

3) Co-incubation assays 

40 Co-incubation assays were carried out by incubating various concentrations of chimeric 60.3 or humanized 

60.3 with a saturating concentration (1 ug) of FITC-conjugated murine 60.3 Abs. Both types of Abs were 
added to the HL60 cells simultaneously; thus eliminating one incubation step. Incubation, washing and cen- 
trifugation were all done as in the pre-incubation assays. The cells were also suspended in 1 % ice cold 
paraformaldehyde before analysis by FACS. 
45 The mean channel values obtained from FACS analyses were translated into linear fluorescence equiva- 

lence (LFE)values. The LFE values were further used to calculate the fluorescence intensity levels (FIL) ac- 
cording to the following equation: 

FIL = LFE of sample/LFE of negative control (no second antibody) 
Electroporation of cells was carried out on a BioRad electroporator, set on capacitance of about 960 fu 
so and .25 volts. A count of viable cells was taken before starting. Cells were at least 90% viable for use. Cells 
were also in the 4-6 x 10 5 /ml range; if they are overgrown, they will not show high transfection efficiency. 1 
x 10 7 cells/electroporation group were removed and one group was used for a control electroporation and one 
for an "unzapped" control. Cells were centrifuged at 1000 rpm for 10 minutes. Supernatant was removed by 
vacuum with a sterile, unplugged pasteur pipet and the pellet was resuspended in as large a volume of PBS 
55 as the tube will allow and washed again. 

The pellet was resuspended in 0.8 ml PBS per 1 x 10 7 cells and 0.8 ml aliquot were added into labeled 
cuvettes. 10 ug each of DNA was added to cuvette. Incubate cuvette on ice for 10 minutes after mixing. The 
cuvettes were electro porated noting time factor reading and put on ice for 10 minutes, and then transferred 
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into 19 mis of IMDM/10% FBS, using 1 ml of medium to wash them out of the cuvettes. Cells were at 37° C 
for 48 hours, and then plated at 10 4 , 3 x 10 3 , and 1 x 10 3 cells per ml in IMDM with 10% FBS and fed for 2-3 
weeks before screening. 

Enzyme-linked immunosorption assays (ELISA) were carried out as is known in the art. In an illustrative 
5 embodiment of ELISA in the present invention was performed as follows. Plates were created by diluting goat 
anti-human IgG 1:10,000 with 0.05M carbonate buffer, pH9.6 and transferring 100 ul to each well of a 96 well 
microtiter plate . The plate was then incubated at 4°C for 12 to 16 hours. The plates were then rinsed 1 to 3x 
and specimen diluent was added. The plates were then incubated at room temperature for 1 hr and rinsed 3x. 
On separate plates, 30 ul of cultural supernatant was diluted to 300 ul with the specimen diluent and 50 ul was 
10 transferred to the previously coated plates. 50 ul of specimen diluent was added and maintained at room temp 
for 1 to 2 hours and rinsed 3x. HRPO- conjugated human anti kappa was diluted with conjugate diluent 1:5,000 
and 100 ul was added per well to plates and incubated at 37°C for 30 minutes and washed 3x. Achromagen 
(1 :300) with buffer substrate, pH 5.5 (room temperature) and 100ul was added to the plates. The plates were 
incubated at room temperature for 15 minutes, 100 ul of 3M H 2 S0 4 was added and the plates were read at a 
15 wavelength of 450 nm and 630 nm. 

Chimeric and humanized Mab 60.3 were analyzed using size exclusion HPLC (secHPLC), sodium dode- 
cylsulfate-polyacrylamidegel electrophoresis (SDS-PAGE), and isoelectric focusing (IEF). Test samples were 
compared to a murine 60.3. 

20 secHPLC 

ATSK3000SW Spherogel 7.5 x 600 mm column manufactured by Toso Haas was used. The mobile phase 
was 0.05 M phosphate buffered saline and samples were eluted at 0.5 ml/minute for 60 minutes. The chroma- 
tograms show that the test lots have multiple contaminating peaks and a major peak eluting at about 27.5 min- 
25 utes. The major peak observed with the reference murine 60.3 Mac chromatogram, eluted at about 26.7 min- 
utes. 

SDS-PAGE 

30 SDS-PAGE was performed using a 4-20% gradient gel and bands were detected by Coomassie blue stain- 

ing. All lots were compared to the murine reference lot. Samples were run both reduced with 2-mercaptoetha- 
nol and non-reduced. Non-reduced gels showed that the major band ran consistent with the reference stan- 
dard. The reduced gel showed data consistent with heavy and light chain separation. 

35 IEF 

Isoelectric focusing was performed using precast gels with a pH range of 3-1 0. Both 2 ug and 5ug of sample 
were applied and run at 100 volts for 60 minutes, 200 volts for 60 minutes, and 500 volts for 30 minutes. Gels 
were stained with Coomassie Blue R-250. The chimeric 60.3 sample had no visible bands. This suggests that 
40 the material did not migrate into the gel. The humanized 60.3 sample had visible bands at the top of the gel 
in a region difficult to determine pi's. This data suggest that the chimeric and humanized lots of antibody have 
pi's greater than 8.0 as determined by the reference markers. This is in contrast to the murine mAb 60.3 which 
has a pi range of 6.8 - 7.5. 

Having described this invention and embodiments thereof, the present invention is further illustrated by 
45 the following Examples which are not intended to limit the scope of the invention. 

EXAMPLE 1: Comparative Model Building 

The structural templates for comparative model building of murine 60.3 were determined. The Brookhaven 
so Protein Data Bank was searched for the sequences with known structure which are most homologous to murine 
60.3 heavy and light chains. The antiphosphocholine murine myeloma antibody, McPC603 (Satowetal. (1986) 
J. Mol. Biol. 190 : 593-604) was found to be the most homologous to the light chain, with 68% sequence iden- 
tity/homology. The anti-p-azophenylarsonate murine monoclonal lgG2, R19.9 (Lascombe et al (1989) Proc. 
Natl. Acad. Sci. USA 86 : 607-611), was found to be the most homologous to the heavy chain, with 59% se- 
55 quence identity/ homology. MCPC603 VL and R1 9.9 VH were combined in order to obtain the structural tem- 
plate for model building of murine and humanized 60.3. 

The CDR loops and known structural determinants of murine 60.3 were determined. Three CDR loops in 
60.3 can be directly assigned to known canonical types (Chothia et al (1989) Nature 342 : 377-383). These 
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are L2 (type 1), L3 (type 1) and H1 (type 1). The remaining CDR loops do not belong to known canonical struc- 
ture types. The boundaries of these loops can be determined by aligning the 60.3 sequence with that of the 
structural model. These assignments are shown along with those of L2, L3 and H1 in Figs. 1 and 2. The fra- 
mework residues which are crucial for the conformations of the CDR loops are also shown in Figs. 1 and 2 
5 ( for example, * L1 indicates structural determinants for the L1 loop) and in Sequence I.D. Numbers 2, 4, 6, 
and 8. 

A model of murine 60.3 was then built. Backbone loop templates for L2 and L3 were taken from McPC603, 
and H1 from R19.9. L1 was initially modeled by a two residue deletion of the L1 loop of McPC603. H2 was 
found to be similar to the corresponding loop region in R19.9 , which may represent a not yet classified can- 
to onical motif. 

No loop closely related to H3 was found in the Brookhaven database. As a initial approximation for H3, an 
antibody non-CDR loop of the same size as 60.3 H3 (defined here as 96 to 1 02 in the Kabat numbering scheme 
or as H99 - H1 09 in the continuous sequence) was used. The backbone template for this loop was a 1 3 residue 
segment of the antibody NEW (Polijak RE. et al. (1974) Proc. Natl. Acad. Sci. USA 71 : 3340), beginning at 

15 residue L8. The loop was selected because it has the same length and showed a reasonable fit into the ad- 
jacent framework of H3. Energy minimization/conformation refinement of murine 60.3 resulted in: 1) residual 
rms derivatives of the energy function : 0.63 kcal/mol A 2) Backbone rms deviations from the initial crystal 
coordinates: V L : 0.86 A ; V H :1 .17 A. 

The most homologous human variable region sequences were found. The human sequence most homo- 

20 logous to 60.3 V L is PIR Accession # A01900 (sequence identity/homology 66%). This is the Vg germline se- 
quence described by Pech and Zachau (Pech, M. and Zachau, H.G. (1 984) Nucleic Acids Res. 12 : 9229-9236). 
Vg belongs to human Vk subgroup Ilia. 

The human sequence most homologous to 60.3 V H is PIR Accession # A32483 (homology 59%). This is 
the heavy chain from human monoclonal Ab MO30 (anti-HIV gp 120) (Larrick et al. (1989) BBRC 160 1250- 

25 1256). There are two germline sequences highly homologous (1 aa mismatch through FR3) to MO30 : 21-2 
and 3-1 (Berman et al (1988) EMBO J. 7: 727-738). These sequences belong to human VH subgroup 1. 

The structural template for murine 60.3 was confirmed to be suitable for the human template. The human 
template for the heavy chain (MO30) is 56% homologous to R19.9. The human template for the light chain (Vg) 
is 62% homologous to McPC603. These numbers are similar to the homology between murine 60.3 and the 

30 same structural templates. The CDR loops and the structural determinants for the human template are shown 
in Figs. 1 and 2. The CDR loops and known structural determinants were then grafted onto the human template 
(Jones et al. (1986) Nature 321 : 522-535). The results are shown in Figures 1 and 2 in the column marked 
Phase I h60.3. In these figures, the h60.3 sequences and all identical sequences from the other columns are 
shaded. 

35 A Phase 1 model of humanized 60.3 was then built using the same model building protocol as for murine 

60.3. The backbone CDR loop templates were the same as for m60.3. Energy minimization/conformational re- 
finement of murine 60.3 and Phase I h60.3 resulted in: 
residual rms derivatives of the energy function: 
m60.3 : 0.63 kcal/molA 

40 Phase 1h60.3 : 0.58 kcal/molA 

Backbone rms deviations from the initial crystal coordinates: 
m60.3 model V L : 0.86 A; V H : 1.17 A 

Phase I h60.3 V L : 0.96 A; V H : 1.15 A 

In Phase II, the murine and Phase I h60.3 models were compared and refined. Acomparison of the models 

45 of murine and humanized 60.3 gave an rms deviation of 0.66 A and 0.78 A for V L and V H , respectively. As a 
result of the comparison, it was postulated that certain residues which were still "human" were important for 
the conformation of the CDR loops. These were therefore changed to the murine residues, and are shown in 
bold in the column marked Phase \U\\\ h60.3 in Figures 1 and 2. Sequence comparisons of m60.3 and Phase 
II 60.3 models (including the CDR loops) gave the following homologies: 



so Phase II 60.3 V L : 

vs.murine 60.3 V L 82% 

vs. human template V L 82% 

Phase II h60.3 V H 

vs. murine 60.3 V H 78% 

55 vs. human template V H 81 % 
Phase II h60.3 F v 

vs. murine 60.3 F v 80% 

vs. human template F v 81 % 
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In Phase III further improvements of the structural model of h60.3 were made. After further refinement of 
the murine and humanized models, the following parameters were obtained: 

Final rms derivatives of the energy function: 
murine 60.3 model: 0.55 kcal/mol A 
5 Phase III h60.3: 0.78 kcal/mol A 

Backbone rms deviations from the initial crystal coordinates: 
murine V L : 0.82 A, V H : 1 .09 A F v : 0.98 A 

Phase III h60.3 V L : 0.88 A, V H : 1.07 A F v : 1.00 A 

Finally, in Phase IV, the binding site features of the Phase III h60.3 model were analyzed. When comparing 
10 murine vs. humanized models at previous stages, the emphasis was more on the comparisons of the CDR - 
framework interactions in the murine antibody and the humanized contacts. The Phase IV model, where the 
L1 and H3 loops were remodeled using conformational search, allows a more detailed analysis of the CDR 
surface than the previous models. 

Analysis of the model suggests that 60.3 is a distinct groove-type antibody and that certain CDR loops 
15 (L2 and H1) may not be involved in antigen binding. In the Phase IV model, the amino acids in these loops 
have been changed to the sequences from the human template. 

EXAMPLE 2 

20 CONSTRUCTION OF THE HUMANIZED 60.3 V H AND V L GENES 

Construction of the humanized 60.3 V genes was done according to the modeling methodology described 
in Example 1 by piecing 5 pairs of complementary oligos together (see Figures 3 and 4 and Sequence I.D. 
Numbers 1 1 to 30) each oligo was about 90 to 1 00 nt in length, together they span the entire V regions, forming 
25 the V H and V L The step by step protocol used was as follows. 

1 . Five microfuge tubes were labeled and the following oligos (10 ng each, 10 ul) were added: 

tube 1: oligo #2 
tube 2: oligo #3 and #4 
tube 3 : oligo #5 and #6 
30 tube 4: oligo #7 and #8 

tube 5 : oligo #9 

2. Tubes 2, 3 and 4 were heated to 100°C for 3 min. and then cooled slowly to room temp. 

3. To all 5 tubes, 10x kinase buffer (40 m M Tris. CI pH7.5, 10 mM MgCI 2 , 10mMDTT, lug DNA, 0.5 mMATP, 
50 ug/ml bovine serum algumin and 1 "weiss" unit T4 DNAIigase), 100 uM ATP (1 ul) and nucleotide kinase 

35 were added to phosphorylate the 5' ends of the oligos. One Weiss unit is equivalent to 60 cohesive-end 

units. Reactions proceeded at 37° for 1 hr. 

4. The tubes were extracted with phenol/CHCI 3 and precipitated with ethanol. 

5. Oligos #1 and #10 (10ng, 10 ul) were added to tubes 1 and 5, respectively. Tubes 1 and 5 were mixed 
and step 2 was repeated for tubes 1 and 5. 

40 6. The contents from all 5 tubes were pooled into a single tube. 

7. Ligate at 12° C, for 12 to 16 hours in a vol. of 25 ul. 

8. Analyze 2 ul on 1.0 % agarose. 

9. After successful ligation, restriction digest an aliquot with EcoRI and Sail for 45 min. at 37°C. 

1 0. Apply onto 3.0% low melt agarose gel and cut out the correct sized band (approximately 0.4kb). 
45 11. Ligate into Puc 18 (pre-digested with EcoRI and Sail). 

12. Transform DH5a cells. 

13. Select the potential positives based on Xgal/IPTG indicators. 

14 Miniprep cultures were prepared and maintained for 12 to 16 hours. 

1 5. Plasmid DNA was isolated from these cultures and the insert sizes were checked by cutting with EcoRI 
so and Sail. 

16. The plasmid DNA was sequenced for verification. Several of the clones had mutations such as single 
base deletions. 

1 7. The synthetic H and L variable genes were cloned into appropriate expression vectors (pNG1 .16 and 
pGk. 11, respectively.) 

55 1 8. The potential positives were isolated after selection on ampicillin. 

1 9. Steps 14 to 16 were repeated. 

20. Transfection into mouse Ag8.653 myeloma cells was carried out, followed by selection with G41 8 (Raff, 
etal., (1991) J. Infect. Pis. 163 : 346-354). 
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21. Ig positives were then screened with ELISA (gamma, kappa capture). The DNA sequences of the mur- 
ine 60.3 heavy and light chains are shown in Sequence I.D. numbers 5 and 7, respectively; and the DNA 
sequences for the humanized 60.3 H and L chains are shown in Sequence I.D. numbers 1 and 3, respec- 
tively. 

5 

EXAMPLE 3 

BINDING ASSAYS OF 60.3 

10 The binding activity of the humanized 60.3 antibody was measured by preincubation, competition and 

chemilluminescence assays. 

HL60 is a human myelomonocytic cell line which expresses CD 18 on the cell surface. 
HL60 target cells, grown in culture media (DM EM, 1% fetal calf serum and 2% L-Glutamine), were made 10 7 /ml 
in the culture media containing 0.1% NaAzide at4°C. 10 6 cells were used per binding assay. Bovine IgG (Sigma, 
15 20 ug/ml final concentration) was added for 1 0 minutes at4°C to mask the Fc receptors before performing the 
assays. Three types of binding assays were performed. 

A) Standard curves 

20 As illustrated in Figure 5, various amounts of murine, chimeric, and humanized 60.3 were incubated for 45 

minutes with HL60 cells. The cells were washed and then incubated for 45 minutes with either -FITC conju- 
gated goat anti mouse IgG (for m60.3) or FITC conjugated goat anti human IgG (for c60.3 and h60.3). Excess 
antibody was washed off and the cells were fixed with 1 % paraformaldehyde and assayed by FACS. For each 
curve, the value obtained at 1500 ng/ml is taken as 100%. Data at other concentrations are plotted as % of 

25 this value. As can be seen in Figure 5, all three antibodies titrate over approximately the same range., indicating 
similar affinities (known to be about 10 9 for m60.3). 

B) Preincubation experiments 

30 HL60 cells were preincubated for 45 minutes with 1 jag/ml of either c60.3 or h60.3. The indicated amount 

of FITC conjugated m60.3 (or no antibody) was then added and the cells were incubated for another 45 min- 
utes. After washing, the cells were fixed with paraformaldehyde and assayed by FACS. As can be seen in Fig- 
ure 6, both the chimeric and humanized antibodies were able to completely block the binding of FITC - m60.3 
to HL60 cells (note FIL = 1 is equivalent to no binding). 

35 

C) Competition experiments 

As illustrated in Figure 7, 1 ^g of FITC conjugated m60.3and the indicated amount of either c60.3 or h60.3 
were coincubated with HL60 cells for 45 minutes. The cells were washed, fixed with paraformaldehyde and 
40 assayed by FACS. As can be seen in Figure 7, the chimeric and humanized antibody competed equivalently 
in this assay. The dashed line shows binding of 1 ^g FITC - m60.3 in absence of competitor. 

3) Chemilluminescence Assay 

45 An assessment of activity of 60.3 by inhibition of Zymosan- induced, luminol-enhancedchemiluminescence 

of PMN was carried out. The material and compositions and procedures used were as follows. 

Preparation of Components: 

so 1.GGVB 

1.1. Materials 

a) 5x Veronal: Dissolve 41 .2g NaCI and 5.095g 5, 5-diethyl barbiturate (Paragon B-2 buffer) in 700 ml deop- 
55 mozed(diH20). Adjust pH to 7.35 + 0.05 with 1N HCI. Bring volume to 1 liter with diH20. Filter sterilize and 

store at 4°C. Stability but is at least 2 months. 

b) Stock metals: mix equal volumes of 2M MgCI 2 (40.66g/100ml) and 0.3M CaCI 2 (4.4g/1 00ml). Filter ster- 
ilize and store at 4°C. Stability is at least 6 months. 
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c) gelatin 

d) dextrose 

1.2. Procedure 

5 

For 300 ml GGVB: 

Add 0.3g gelatin and 0.3g dextrose to 240 ml diH20. Heat with mixing until just dissolved. Let cool to 
below 37C. 

Add 60 ml 5x Veronal and 0.3 ml stock metals. Filter sterilize and store at 4C. 
10 This is made up fresh for each day's assays. 

2.2. Zymosan 

2.1. Materials: 

15 

a) Zymosan A, SigmaZ-4250 

b) diH20 

c) 1 x PBS 

20 2.2. Procedure: 

Suspend zymosan to 25 mg/ml (1 .5g/60ml) in diH 2 0. Heat in glass in a water bath at 100C for 60 min. 
Transfer to 50 ml polypropylene centrifuge tubes. Centrifuge (8500 rpm in the TJ-6, 10 min, RT) and 
wash twice with 1 x PBS. Resuspend to 50 mg/ml (1.5g/30ml) in 1 x PBS. Store at 4C. Stability is at least 1- 
25 2 months at 4C. 

3. Complement (adsorbed human serum) 

3.1. Material: 

30 

a) Freshly drawn human blood, without anticoagulants. Type O may be best, or use the same donor that 
supplies PMN. Transfer blood immediately after drawing into sterile 50ml glass centrifuge tubes (available 
from blood lab). 

b) 0.1 M EDTA, pH 7.35. Dissolve 3.72g NA 2 H 2 EDTA in 80 ml di H20, with mixing. Adjust pH to 7.35 ± 
35 0.05 with freshly prepared 2N NaOH. Bring to 1 00 ml volume with diH20. 

c) Zymosan prep (see 2.1) 

3.2. Procedure: 

40 Allow blood to clot at room temperature for 1 hour. Rim the tube with a glass pi pet to contract the clot. 

Centrifuge (8500 RPM in TJ-6, 20 min., RT) and carefully remove serum to a sterile polypropylene tube. Repeat 
centrifugation if necessary. 

Add 0. 1 M EDTA to serum to 1 0% mg/ml zymosan needed to adsorb serum at 0.2 mg zymosan/ml of serum. 
Add this volume of zymosan to each of four centrifuge tubes. Centrifuge zymosan tubes (8500 rpm in the TJ- 
45 6, 10 min,. RT) and remove supernatant. Keep the tubes on ice. 

Resuspend the zymosan pellet of one tube with a small amount of serum. Add the reaminder of the 
serum and mix by inversion. 

Incubate the serum/zymosan mixture on a rocker or rotator, 30 min, 4C. 
Centrifuge as above. Repeat the adsorption with the other three tubes. 
50 Afterthe final centrifugation, filter the serum through a 0.45um syringefilter Aliquot into microfuge tubes 

or equivalent and store at -70C. The adsorbed serum has an unknown stability but it probably good for at least 
six months. 

4 Human PMN 

55 

4.1. Materials 

a) Freshly drawn heparinized blood. 
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b) Ficoll-Hypaque Mono-Poly Resolving Medium (MPRM), Flow Labs no. 16-980-49 

c) 1 x PBS 

d) 3% HoAc in diH20 

e) Serum-free, phenol red-free RPMI with 0.5% gelatin (RPMI-gel): Add gelatin (1.25g/250 ml) to serum- 
5 free, phenol red-free RPMI and heat with stirring until gelatin is just dissolved. Filter sterilize (0.45 urn filter) 

and store in 50ml aliquots at4C. Stability is least 2 weeks. 

4.2. Procedure 

10 Aliquot 4 ml MPRM into 15 ml polypropylene centrifuge tubes. 

Overlay gently with 5 ml fresh blood. 

Centrifuge (30 min, 2000 rpm in TJ-6, RT). If RBC are not completely pelleted, centrifuge an additional 
10 minutes. If no RBC sedimentation has occurred, additional centrifugation will not work and another donor 
must be found. 

15 Aspirate top (plasma) layer and first layer of cells (monocytes). 

With a Pasteur pipet, carefully remove the second cell layer (PMN) to a 50ml centrifuge tube. Repeat 
with all tubes. 

Add RT at room temperature, 1 x PBS to the PMN tube to a final volume of 50 ml and mix gently. 

Remove 50ul and dilute appropriately in 3% HoAc (usually 1:20 is appropriate for a prep of 30ml whole 
20 blood) and count PMN in a hemocytometer. Calculate total PMN present in the 50ml tube. 

Centrifuge PMN (10 min, 2000 rpm in TJ-6, RT) and resuspend to desired density (normally 2 x 10 6 ml) 
in RPMI-gel. Store at RT, swirling gently to resuspend occasionally (once or twice an hour). Viability should 
be > 90% after 6 hours; if used after 6-8 hours, check viability by trypan blue exclusion prior to use. 

25 5 Luminol 

5.1. Materials 

a) Luminol (5 amino-2,3 dihydro - 1 ,4 - phthalazinedione), Sigma no. A-8511. 
30 FW= 172.2 

b) DMSO, chromatography grade 

5.2. Procedure 

35 Dissolve luminol to 10" 2 M in DMSO (17.7 mg/10ml). Store at 4C in the dark (wrap tube in foil). Stability 

is more than 1 month; make fresh 3-4 weeks. 

6. Antibody 

40 6.1. Materials and Procedure 

Dilute antibody in GGVB to appropriate concentrations. 
Assay Design 

45 

Test volumes per tube are normally 1 ml, composed of the following. 

- 100 ul antibody (sensitivity to concentration is unknown, previous work has been with 5 ug/ml preps) 

- 100 ul PMN (normal concentration at 2 x 10 6 /ml) 

- mix by swirling gently and incubate 30 min, at room temperature (RT) (coverall tubes with one sheet of 
so Parafilm) 

- 100 u I Zymosan (high concentrations, around 50mg/ml,seem to work best) 

- just before loading luminometer, add: 

- 600 ul luminol (diluted in GGVB , normal concentrations are 10- 4 or 5 x 10" 5 M) 

- 100 ul complement (low concentrations, on the order of 1-2%, seem to work better) 

55 

Staging of Assay 

Set up CL tubes (Clinicon 2174-089, available through LKB). Keep in the dark (in a drawer) to prevent spon- 
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taneous luminescence from absorbed fluorescent light. 

Thaw complement at RT and hold on ice, just prior to assay complement may be unstable (may lost 50% of 
activity in 6-8 hours) at 4C. 

Add antibody, PMN, and zymosan as described above to CL tubes. 
5 Take tubes to luminometer. Program assay parameter into controlling computer. 

Prepare final dilution of Luminol (and keep wrapped in foil) just before adding to CL tubes. 
Prepare final dilution of complement in GGVB. 

Add luminol and complement to CL tubes as described above and load into luminometer. Start the program 
immediately. Peak luminescence is reached 4-5 minutes after adding complement. 
10 Throughout assay setup and during t he first rotation of the CL tubes in the luminometer, apply anti-static charge 
with the anti-static gun to prevent CL tubes hanging up in the luminometer. 

Results 

15 The above procedure was utilized to measure the ability of murine, chimeric, and humanized 60.3 to inhibit 
CR3(CDIIb/CD18) medicated up take of opsonized zymosan by neutrophils. In the absence of 60.3. phagocytic 
uptake of opsonized zymosan results in an increase in hexase monophosphate shunt activity that is measured 
as light output by luminol-enhanced chemiluminescence. 
% inhibition of chemiluminescence signal = 100 x(1 - signal of sample/signal of negative control anti- 

20 body) 

The results shown in Figure 8 illustrate that all of the 60.3 antibodies were reactive in this assay, while an ir- 
relevant antibody (murine L6 anti-tumor antibody) showed no reactivity. 

EXAMPLE 4 

25 

CONSTRUCTION OF pGK.11 

In order to express either chimeric or humanized light chains, cassette vectors were constructed capable 
of expressing variable region genes, synthesized using PCR (polymerase chain reaction). In addition to se- 
30 quences found on pSV2-gpt, these vectors contain an Ig promoter and leader, the 5' portion of an intron, a 
MCS for insertion of the variable region gene, the mouse heavy chain enhancer (MHE), the 3' portion of an 
intron and the human Ck gene. The variable region was PCRed so that in addition to the coding region it con- 
tained flanking intron sequences. 

The cassette for the expression of the light chain was constructed as follows (and is illustrated in Figure 

35 9): 

1 . The 121 bp Hind III to Bgl II fragment of pSV2-gpt (Mulligan and Berg (1980) Science 209 : 1422) was 
deleted from the 5' end of the Ecogpt by restriction with Hind III and Bgl II, filling in with Klenow polymerase, 
and religation. The product, pG.2, was detected by the absence of Hind III and Bgl II sites and linearization 
with EcoRI. 

40 2. The pBR322 part of pG.2 (EcoR l-Pvu II fragment) was replaced with the analogous portion of pUC 1 8 

(Pvu II- Pvu II fragment) to form pG. 3 This product was screened for by digestion with Pstl, which linearizes 
pG.3 giving a 5.05 kb fragment. 

3. pG.3 was made into pG.5 by the replacement of the 750 bp EcoR I to BamH I fragment with the 64 bp 
EcoR I to BamH I multiple cloning site (MCS) from plC20R (Marsh etal. 1984). pG.5 was screened for the 

45 presence of an Xho I site (part of the MCS). 

4. A Not I site was inserted in the Nde I site of pG.5 to form pG.12. Oligonucleotide linkers (Sequence I.D. 
numbers 33 and 34) were used for this purpose. pG.12 was screened for the presence of a Not I site. 

5. A 2.75 kb EcoR I fragment containing the human Ck gene was inserted into the EcoR I site of the MCS 
of pG.12 to form pGk.3. When the Ck fragment was in the correct orientation, Sac I digestion produced 

50 0.126, 0.509 and 6.7 kb fragments (vs. 0.509, 2.1 and 4.7 kb fragments in the wrong orientation). 

6. A 140 bp portion of the SV140 enhancer was removed from pG.3 by restriction with Sph I, destroying 
the overhang with the exonuclease activity in Klenow polymerase, followed by digestion with Pvu II and 
blunt end ligation. The product, pG.9, was screened for the loss of Pvu II, Nsi I and Sph I sites. 

7. A Not I site was inserted in the Nde I site of pG.9 to form pG.10. Oligonucleotide linkers, described in 
55 #4 above, were used for this purpose. pG.10 was screened for the presence of a Not I site. 

8. The 195 bp Not I to BamH I fragment from pG.12 was inserted into the Not I to BamH I site of pG.10 to 
form pG.11. This served to place an 879bp fragment from pG.10 with a 195 base pair fragment containing 
a MCS. pG.11 was screened for the presence of Xho I site in the MCS region. 
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9. The 3kb Nar I to Cla I fragment from pGk.3 was directionally subcloned into the same sites of pG.11 to 
form pGk.4. 

1 0. A 1 kb fragment containing the mouse heavy chain enhancer was transferred from pICMHEXX to pGk.4 
as a Cla I to Hind III fragment, thus forming pGk.5. EcoR I digestion of pGk.5. produced 0.3, 2.75 and 5.1 

5 kb fragments. pICMHEXX was made by the insertion of the 1kb Xba I fragment (filled in with Klenow poly- 

merase) from RBL 216 (Lang et al (1982) Nucl. Acid Res. 10: 611-620) into the filled in Bgl II site of pIC 
1 9 R (Marsh etal (1984) Gene 32: 481-486). 

1 1 . A 579 bp Sau3a I fragment containing the 4B9 promoter pGkA1 .9 (Raff et al, 1991) was inserted into the 
BamH I site (in the MCS) of pGk.5 to form pGk.11. The resulting plasmids were screened for the correct ori- 

10 entation of the insert : BamH I plus Asp718 I digestion gave a 1 .2 kb fragment in the correct orientation vs. a 

2.2 kb fragment in the wrong orientation. Also, BamH I plus Hind III digestion should give a 0.58 kb fragment. 
The sequence of specific regions is contained in the following segments. The sequence is given in clock- 
wise orientation beginning at the EcoR I site at 0° and is illustrated in Figure 10 and Sequence I.D. 10. 
The ampicillin resistance gene is bp 7383 to 8241 
15 The ecogpt gene is bp 5651 to 6107 

The mouse heavy chain enhancer is bp 2770 to 3788 

The Sau3a I fragment containing the Alk promoter and leader is bp 3827 to 4393, with the leader peptide en- 
coded by bp 3951 to 3999. 

The EcoR I fragment containing the human Ck gene is bp 6 to 2756, with the Ck region itself encoded by bp 
20 211 3 to 2435. 

EXAMPLE 5 

Construction of pNyl .16 

25 

In order to express either chimeric or humanized heavy chains, cassette vectors were constructed capable 
of expressing variable region genes synthesized using PCR (polymerase chain reaction). In addition to se- 
quences found on pSV2-neo, these vectors contain an Ig promoter and leader, the 5' portion of an intron, a 
MCS for insertion of the variable region gene, the moue heavy chain enhancer (MHE), the 3' portion of an 
30 intron and the human C y1 gene. The variable region gene is PCRed so that in addition to the coding region 
it contains flanking intron sequences. 

The cassette for the expression of the heavy chain was constructed as follows (as illustrated in Figure 11) 
1 . The Hind III site in pSV2-neo was removed by digestion with Hind III, fill in with Klenow polymerase and 
religation. The product, pN.1 was screened for the absence of the Hind III site. 
35 2. The 750 bp EcoR I to BamH I fragment from pN.1 was replaced by a 64 bp EcoR I to BamH I multiple 

cloning site (MCS) from plC20R to form pN.5 

3. A PCR region was done on p1 CMHEXX in a manner that primers were chosen to delete the EcoR I site 
while generating a new MCS region. This resulted in a product consisting of: a) recognition sequences for 
EcoR V, Asp 718 I, Sac I and Xho I; b) a 695 bp of the mouse heavy chain enhancer from the 5'Xba I site 

40 to the EcoR I site; and c) recognition sequences for Hind III, Sal I and BamH I. This 723 bp PCR product 

was cloned into p1C20R to form pMHE.per. 

4. The 723 EcoRV- BamH I fragment from pMHE.per was subcloned into the same sites in the MCS region 
of pN.5. This removed the previous MCS, while inserting the one associated with the MHE. The product, 
pN.8, was screened for the presence of 0.7 and 5.0 kb Xho I - Hind III fragments and for linearization to 

45 5.7 kb with EcoR I. 

5. Two PCR SOEing (Horton etal , (1990) Biotech. 8 : 528-535) reactions were used to create several mu- 
tations in the L6 heavy chain promoter. Outer primers had enzyme sites EcoR V and Sacl for subcloning 
into plC20R to form pMUTL6HCP. The sequence of this insert is shown as bp 7793-8495 in Fig. 12 and 
in Sequence I.D. number 9. 

50 6. The 703 bp EcoR V to Sacl fragment from pMUTL6HCP was inserted into the same sites of pN.8 to 

form pN.9. 

7. A3.5 kb Xho I - BamH I fragment from pNyl A2.5, containing the PCRed MHEXR (Sequence I.D. Number 
31) plus the 2.8 kb Hindm - BamH I fragment encoding the human y1 gene (Sequence I.D. Number 32) 
was inserted into the same sites of pN.9 to form pNy1.16. The sequence of this insert is shown as bp 2- 
55 2799 in Fig. 12 and in Sequence I.D. Number 9. 

The foregoing description and the Examples are intended as illustrative and are not to be taken as limiting. 

Still other variations within the spirit and scope of this invention are possible and will readily present themselves 

to those skilled in the art. 
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G 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
20 1 5 10 15 

Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asp Tyr 
20 25 30 

Trp Met Asn Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp Met 
25 35 40 45 

Gly Arg lie Asp Pro Ser Asp Ser Glu Thr His Tyr Asn Gin Lys Phe 
50 55 60 



361 



30 



35 



Gin Gly Arg Val Thr Met Thr Arg Asp Thr Ser Thr Ser Thr Val Tyr 
65 70 75 80 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Gly Gly Arg Leu Gly Ser Phe Ala Met Asp Tyr Trp Gly Gin 
100 105 no 

Gly Thr Leu val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



45 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAAATTGTGC TAACACAATC TCC AGCT AC A TTGTCTTTGT CTCCAGGTGA GAGAGCCACT 60 
50 CTAT CCTGC A GAGCCAGTGA AAGTGT TGAT AGTTATGGCA ATAGTTTTAT GCACTGGTAC 120 
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CAGCAGAAAC CAGGACAGGC ACCAAGGCTC CTCATCTATC GTGCATCCAA CCTAGAAACT 180 

GGTATCCCTG CCAGGTTCAG TGGCAGTGGT TCTAGGACAG ACTTCACTCT CACCTATTCT 240 

1Q TCTCTAGAGC CTGAAGATTT TGCAGTGTAT TACTGTCAGC AAAGTAATGA GGATCCTCGG 300 

ACGTTCGGTG GAGGCACCAA GG TGGAAG AG AAAC 334 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 111 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



20 



25 



30 



Glu He val Leu Thr Gin Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
1 5 10 15 

Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Glu Ser Val Asp Ser Tyr 
20 25 30 

Gly Asn Ser Phe Met His Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro 
35 40 45 

Arg Leu Leu He Tyr Arg Ala Ser Asn Leu Glu Thr Gly lie Pro Ala 
50 55 60 

Arg Phe Ser Gly Ser Gly Ser Arg Thr Asp Phe Thr Leu Thr Tyr Ser 
65 70 75 80 

Ser Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Ser Asn 
85 90 95 

Glu Asp Pro Arg Thr Phe Gly Gly Gly Thr Lys Val Glu Glu Lys 
100 105 HO 

40 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



35 



45 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
50 CAGGTCCAAC TTCAGCAGCC TGGGCCTGAC CTTCTGAAGC CTGGGGCTCC AGTGAAGCTG 60 
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10 



15 



20 



25 



Gin Val Gin Leu Gin Gin Pro Gly Pro Asp Leu Leu Lys Pro Gly Ala 
1 5 10 15 

Pro Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asp Tyr 

30 2 0 2 5 3 0 

Trp Met Asn Trp Val Lys Gin Arg Pro Gly Arg Gly Leu Glu Trp He 
35 40 45 



35 



40 



Gly Arg He Asp Pro Ser Asp Ser Glu Thr His Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr Leu Thr Val Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

lie Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Gly Gly Arg Leu Gly Ser Phe Ala Met Asp Tyr Trp Gly Gin 
100 105 110 

Gly Thr Ser Val Thr Val Ser Ser 
115 120 

45 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



50 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



361 



TCCTGCAAGG CTTCTGGCTA CACCTTCACC GACTACTGGA TGAACTGGGT TAAGCAGAGG 120 

CCTGG AC G AG GCCTCGAGTG GATTGGAAGG ATTGATCCTT CCGATAGTGA AACTCACTAC 180 

AATCAGAAGT TCAAGGACAA GGCCACACTG ACTGTAGACA AATCCTCCAG CACAGCCTAC 240 

ATCCAACTCA GCAGCCTGAC ATCTGAGGAC TCTGCAGTCT ATTACTGTGC ACGAGGGGGA 300 

CGGCTCGGGT CCTTTGCTAT GGACTACTGG GGTCAAGGCA CCTCAGTCAC CGTCTCCTCA 360 
G 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

10 

GACATTGTGC TAACACAATC TCCAGCTTCT TTGGCTGTGT CTCTAGGGCA GAGGGCCACC 60 
ATATCCTGCA GAGCCAGTGA AAGTGTTGAT AGTTATGGCA ATAGTTTTAT GCACTGGTAC 120 
CAGCAGAAAC CAGGACAGCC ACCCAAACTC CTCATCTATC GTGCATCCAA CCTAGAATCT 180 
15 GGGATCCCTG CCAGGTTCAG TGGCAGTGGG TCTAGGACAG ACTTCACCCT CACCATTAAT 240 

CCTGTGGAGG CTGATGATGT TGCAACCTAT TACTGTCAGC AAAGTAATGA GGATCCTCGG 300 
ACGTTCGGTG GAGGCACCAA GCTGGAAATC AAAC 33 4 

(2) INFORMATION FOR SEQ ID NO : 8 : 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asp lie Val Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Leu Gly 
15 10 15 

Gin Arg Ala Thr lie Ser Cys Arg Ala Ser Glu Ser Val Asp Ser Tyr 
20 25 30 

Gly Asn Ser Phe Met His Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro 
35 40 45 

Lys Leu Leu lie Tyr Arg Ala Ser Asn Leu Glu Ser Gly lie Pro Ala 
50 55 60 

Arg Phe Ser Gly Ser Gly Ser Arg Thr Asp Phe Thr Leu Thr lie Asn 
65 70 75 80 

Pro Val Glu Ala Asp Asp Val Ala Thr Tyr Tyr Cys Gin Gin Ser Asn 
85 90 95 

Glu Asp Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 
45 100 105 110 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9201 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



55 
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5 

(ii) MOLECULE TYPE: DNA (genomic) 

10 , . „ 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AAGCTTTCTG GGGCAGGCCA GGCCTGACCT TGGCTTTGGG GCAGGGAGGG GGCTAAGGTG 60 

AGGCAGGTGG CGCCAGCCAG GTGCACACCC AATGCCCATG AGCCCAGACA CTGGACGCTG 120 

15 AACCTCGCGG ACAGTTAAGA AC C C AGGGGC CTCTGCGCCC TGGGCCCAGC TCTGTCCCAC 180 

ACCGCGGTCA CATGGCACCA CCTCTCTTGC AGCCTCCACC AAGGGCCCAT CGGTCTTCCC 240 

CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGGCT GCCTGGTCAA 300 

GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA GGCGCCCTGA CCAGCGGCGT 3 60 

20 

GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC TCCCTCAGCA GCGTGGTGAC 420 

CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC AACGTGAATC ACAAGCCCAG 480 

CAACACCAAG GTGGACAAAC GCGTTGGTGA GAGGCCAGCA CAGGGAGGGA GGGTGTCTGC 540 

25 TGGAAGCCAG GCTCAGCGCT CCTGCCTGGA CGCATCCCGG CTATGCAGCC CCAGTCCAGG 600 

GCAGCAAGGC AGGCCCCGTC TGCCTCTTCA CCCGGAGGCC TCTGCCCGCC CCACTCATGC 660 

TCAGGGAGAG GGTCTTCTGG CTTTTTCCCC AGGCTCTGGG CAGGCACAGG CTAGGTGCCC 7 20 

30 CTAACCCAGG CCCTGCACAC AAAGGGGCAG GTGCTGGGCT CAGACCTGCC AAGAGCCATA 780 

TCCGGGAGGA CCCTGCCCCT GACCTAAGCC CACCCCAAAG GCCAAACTCT CCACTCCCTC 840 

AGCTCGGACA CCTTCTCTCC TCCCAGATTC CAGTAACTCC CAATCTTCTC TCTGCAGAGC 900 

35 CCAAATCTTG TGACAAAACT CACACATGCC CACCGTGCCC AGGTAAGCCA GCCCAGGCCT 960 

CGCCCTCCAG CTCAAGGCGG GACAGGTGCC CTAGAGTAGC CTGCATCCAG GGACAGGCCC 1020 

CAGCCGGGTG CTGACACGTC CACCTCCATC TCTTCCTCAG CACCTGAACT CCTGGGGGGA 1080 

CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 1140 

40 

GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 1200 

TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 1260 

AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 1320 

45 GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 1380 

AAAGCCAAAG GTGGGACCCG TGGGGTGCGA GGGCCACATG GACAGAGGCC GGCTCGGCCC 1440 

ACCCTCTGCC CTGAGAGTGA CCGCTGTACC AACCTCTGTC CCTACAGGGC AGCCCCGAGA 1500 

ACCACAGGTG TACACCCTGC CCCCATCTAG AGAGGAGATG ACCAAGAACC AGGTCAGCCT 1560 

50 

GACCTGCCTG GTCAAAGGCT TCTATCCCAG CGACATCGCC GTGGAGTGGG AGAGCAATGG 1620 

55 
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5 



10 



GCAGCCGGAG 


AACAACTACA 


AGACCACGCC 


CCTCTACAGC 


AAGCTCACCG 


TGGACAAGAG 


CTCCGTGATG 


CATGAGGCTC 


TGCACAACCA 


GGGTAAATGA 


GTGCGACGGC 


CGGCAAGCCC 


GATGCTTGGC 


ACGTACCCCC 


TGTACATACT 


CCAGCGCTGC 


CCTGGGCCCC 


TGCGAGACTG 


CTGAGGCCTG 


AGTGGCATGA 


GGGAGGCAGA 


GCTGTGCAGG 


TGTGCCTGGG 


CCCCCTAGGG 


GGTGGGGGAT 


TTGCCAGCGT 


GGCCCTCCCT 



20 



25 



GGAAGCCCTA 


GGAGCCCCTG 


GGGACAGACA 


TGTTCTGTGA 


GCGCCCCTGT 


CCTCCCGACC 


ATGTGCGTAG 


GGACAGGCCC 




CCCTGCCCAG 


CCTCGCACCC 


GCATGGGGAC 


GCCCTGTGGA 


GGGACTGGTG 


CAGATGCCCA 


AACCCCGCAC 


TGAGGTTGGC 


CGGCCACACG 


CACGGAGCCT 


CACCCGGGCG 


AACTGCACAG 


GTGAACACTC 


CTCGGACACA 


GGCCCCCACG 


CCTCTCGGCA 


GCTTCTCCAC 


ATGCTGACCT 


GGGTGCCCCT 


GCAGCCGCCA 


CACACACACA 


CCTGGCCCAC 


TTCCCAGTGC 


CGCCCTTCCC 


ATTGATGAGT 


TTGGACAAAC 


CACAACTAGA 


ATTTGTGATG 


CTATTGCTTT 


ATTTGTAACC 


AACAATTGCA 


TTCATTTTAT 


GTTTCAGGTT 


AAGTAAAACC 


TCTACAAATG 


TGGTATGGCT 


TCAAATATTC 


CTTATTAACC 


CCTTTACAAA 


CATAGTTATT 


AATAGCAGAC 


ACTCTATGCC 


GATTATAACT 


GTTATGCCTA 


CTTATAAAGG 


ATAGCAGTGC 


AGCTTTTTCC 


TTTGTGGTGT 


TAAACACAGC 


ATGACTCAAA 


AAACTTAGCA 


CCTTTCTCTT 


CTTTTTTGGA 


GGAGTAGAAT 



TCCCGTGCTG 


GACTCCGACG 


GCTCCTTCTT 


1680 


CAGGTGGCAG 


CAGGGGAACG 


TCTTCTCATG 


1740 


CTACACGCAG 


AAGAGCCTCT 


CCCTGTCTCC 


1800 


CCGCTCCCCG 


GGCTCTCGCG 


GTCGCACGAG 


1860 


TCCCGGGCGC 


CCAGCATGGA 


AATAAAGCAC 


1920 


TGATGGTTCT 


TTCCACGGGT 


CAGGCCGAGT 


1980 


GCGGGTCCCA 


CTGTCCCCAC 


ACTGGCCCAG 


2040 


TGGGGCTCAG 


CCAGGGGCTG 


CCCTCGGCAG 


2100 


CCAGCAGCAC 


CTGCCCTGGG 


CTGGGCCACG 


2160 


CACAGCCCCT 


GCCTCTGTAG 


GAGACTGTCC 


2220 


TCCATGCCCA 


CTCGGGGGCA 


TGCCTAGTCC 


2280 


ATCTACCCCC 


ACGGCACTAA 


CCCCTGGCTG 


2340 


ACAACCGACT 


CCGGGGACAT 


GCACTCTCGG 


2400 


CACACACACT 


CAGCCCAGAC 


CCGTTCAACA 


2460 


GCCACCACAC 


ACACACGTGC 


ACGCCTCACA 


2520 


CACCCAGACC 


AGAGCAAGGT 


CCTCGCACAC 


2580 


AGCCCCACGC 


GGCACCTCAA 


GGCCCACGAG 


2640 


GCTCAGACAA 


ACCCAGCCCT 


CCTCTCACAA 


2700 


GGGGATCACA 


CACCACGTCA 


CGTCCCTGGC 


2760 


TGCAGGACGG 


ATCCAGACAT 


GATAAGATAC 


2820 


A 1 LrL-AO 1 bAft 


& X x. A A rnp r«rpm 
AAAAA 1 \jry* 1 L 


TATi TdTCjAA 


2880 


ATTATAAGCT 


GCAATAAACA 


AGTTAACAAC 


2940 


CAGGGGGAGG 


TGTGGGAGGT 


TTTTTAAAGC 


3000 


GATTATGATC 


TCTAGTCAAG 


GCACTATACA 


3060 


TTAAAAAGCT 


AAAGGTACAC 


AATTTTTGAG 


3120 


TGTGTGGAGT 


AAGAAAAAAC 


AGTATGTTAT 


3180 


TTACAGAATA 


TTTTTCCATA 


ATTTTCTTGT 


3240 


AAATAGCAAA 


GCAAGCAAGA 


GTTCTATTAC 


3300 


ATTCTGAAGG 


AAAGTCCTTG 


GGGTCTTCTA 


3360 


GTTGAGAGTC 


AGCAGTAGCC 


TCATCATCAC 


3420 



55 
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5 

TAGATGGCAT TTCTTCTGAG CAAAACAGGT TTTCCTCATT AAAGGCATTC CACCACTGCT 3480 

CCCATTCATC AGTTCCATAG GTTGGAATCT AAAATACACA AACAATTAGA ATCAGTAGTT 3540 

10 

TAACACATTA TACACTTAAA AATTTTATAT TTACCTTAGA GCTTTAAATC TCTGTAGGTA 3600 

GTTTGTCCAA TTATGTCACA CCACAGAAGT AAGGTTCCTT CACAAAGATC C GGG ACC AAA 3 660 

GCGGCCATCG TGCCTCCCCA CTCCTGCAGT TCGGGGGCAT GGATGCGCGG ATAGCCGCTG 3720 

15 CTGGTTTCCT GGATGCCGAC GGATTTGCAC TGCCGGTAGA ACTCCGCGAG GTCGTCCAGC 3780 

CTCAGGCAGC AGCTGAACCA ACTCGCGAGG GGATCGAGCC CGGGGTGGGC GAAGAACTCC 3840 

AGCATGAGAT CCCCGCGCTG G AGGATC AT C CAGCCGGCGT CCCGGAAAAC GATTCCGAAG 3900 

CCCAACCTTT CATAGAAGGC GGCGGTGGAA TCGAAATCTC GTGATGGCAG GTTGGGCGTC 3960 

20 

GCTTGGTCGG TCATTTCGAA CCCCAGAGTC CCGCTCAGAA GAACTCGTCA AGAAGGCGAT 4020 

AGAAGGCGAT GCGCTGCGAA TCGGGAGCGG CGATACCGTA AAGCACGAGG AAGCGGTCAG 4080 

CCCATTCGCC GCCAAGCTCT TCAGCAATAT CACGGGTAGC CAACGCTATG TCCTGATAGC 4140 

25 

GGTCCGCCAC ACCCAGCCGG CCACAGTCGA TGAATCCAGA AAAGCGGC C A TTTTCCACCA 4200 

TGATATTCGG CAAGCAGGCA TCGCCATGGG TCACGACGAG ATCCTCGCCG TCGGGCATGC 4260 

GCGC CTTGAG CCTGGCGAAC AGTTCGGCTG GCGCGAGCCC CTGATGCTCT TCGTCCAGAT 4 3 20 

30 CATCCTGATC GACAAGACCG GCTTCCATCC GAGTACGTGC TCGCTCGATG CGATGTTTCG 4380 

CTTGGTGGTC GAATGGGCAG GTAGCCGGAT CAAGCGTATG CAGCCGCCGC ATTGCATCAG 4440 

CCATGATGGA TACTTTCTCG GCAGGAGCAA GGTGAGATGA CAGGAGATCC TGCCCCGGCA 4500 

35 CTTCGCCCAA TAGCAGCCAG TCCCTTCCCG CTTCAGTGAC AACGTCGAGC ACAGCTGCGC 4560 

AAGGAACGCC CGTCGTGGCC AGCCACGATA GCCGCGCTGC CTCGTCCTGC AGTTCATTCA 4620 

GGGCACCGGA CAGGTCGGTC TTGACAAAAA GAACC GGGCG CCCCTGCGCT GACAGCCGGA 4680 

ACACGGCGGC ATCAGAGCAG CCGATTGTCT GTTGTGCCCA GTCATAGCCG AATAGCCTCT 4740 

40 

CCACCCAAGC GGCCGGAGAA CCTGCGTGCA ATCCATCTTG TTCAATCATG CGAAACGATC 4800 

CTCATCCTGT CTCTTGATCA GATCTTGATC CCCTGCGCCA TCAGATCCTT GGCGGCAAGA 4860 

AAGCCATCCA GTTTACTTTG CAGGGCTTCC CAACCTTACC AG AGGGC GCC CCAGCTGGCA 4920 

45 ATTCCGGTTC GCTTGCTGTC CATAAAACCG CCCAGTCTAG CTATCGCCAT GTAAGCCCAC 4 980 

TGCAAGCTAC CTGCTTTCTC TTTGCGCTTG CGTTTTCCCT TGTCCAGATA GCCCAGTAGC 5040 

TGACATTCAT CCGGGGTCAG CACCGTTTCT GCGGACTGGC TTTCTACGTG TTCCGCTTCC 5100 

TTTAGCAGCC CTTGCGCCCT GAGTGCTTGC GGCAGCGTGA AGC TAGCTTT TTGCAAAAGC 5160 

50 

CTAGGCCTCC AAAAAAGCCT CCTCACTACT TCTGGAATAG CTCAGAGGCC GAGGCGGCCT 5220 

55 
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CGGCCTCTGC ATAAATAAAA AAAATTAGTC 
GCGGAGTTAG GGCGGAACTG GGCGGAGTTA 
GATGCATGCT TTGCATACTT CTGCCTGCTG 
CTGACTAATT GAGATGCATG CTTTGCATAC 
CACACCCTAA CTGACACACA TTCCACAGCT 
15 ACCTCTGACA CATGCAGCTC CCGGAGACGG 

GCAGACAAGC CCGTCAGGGC GCGTCAGCGG 
CCCAGTCACG T AGCG AT AG C GGAGTGTATA 
TGTACTGAGA GTGCACCATA TGCGGTGTGA 

20 



25 



50 



CCGCATCAGG 


CGCTCTTCCG 


CTTCCTCGCT 


GCGGCGAGCG 


GTATCAGCTC 


ACTCAAAGGC 


TAACGCAGGA 


AAGAACATGT 


GAGCAAAAGG 


CGCGTTGCTG 


GCGTTTTTCC 


ATAGGCTCCG 


CTCAAGTCAG 


AGGTGGCGAA 


ACCCGACAGG 


AAGCTCCCTC 


GTGCGCTCTC 


CTGTTCCGAC 


TCTCCCTTCG 


GGAAGCGTGG 


CGCTTTCTCA 


GTAGGTCGTT 


CGCTCCAAGC 


TGGGCTGTGT 


CGCCTTATCC 


GGTAACTATC 


GTCTTGAGTC 


GGCAGCAGCC 


ACTGGTAACA 


GGATTAGCAG 


CTTGAAGTGG 


TGGCCTAACT 


ACGGCTACAC 


GCTGAAGCCA 


GTTACCTTCG 


GAAAAAGAGT 


CGCTGGTAGC 


GGTGGTTTTT 


TTGTTTGCAA 


T C AAGAAG AT 


CCTTTGATCT 


TTTCTACGGG 


TTAAGGGATT 


TTGGTCATGA 


GATTATCAAA 


AAAATGAAGT 


TTTAAATCAA 


TCTAAAGTAT 


ATGCTTAATC 


AGTGAGGCAC 


CTATCTCAGC 


CTGACTCCCC 


GTCGTGTAGA 


TAACTACGAT 


TGCAATGATA 


CCGCGAGACC 


CACGCTCACC 


AGCCGGAAGG 


GCCGAGCGCA 


GAAGTGGTCC 


TAATTGTTGC 


CGGGAAGCTA 


GAGTAAGTAG 



AGCCATGGGG CGGAGAATGG GGC GGGATGG 5280 

GGGGCGGGAC TATGGTTGCT GACTAATTGA 5340 

GGGAGCCTGG GGACTTTCCA CACCTGGTTG 5400 

TTCTGCCTGC TGGGGAGCCT GGGGACTTTC 5460 

GCCTCGCGCG TTTCGGTGAT GACGGTGAAA 5520 

TCACAGCTTG TCTGTAAGCG GATGC CGGG A 5580 

GTGTTGGCGG GTGTCGGGGC GCAGCCATGA 5640 

CTGGCTTAAC TATGCGGCAT CAGAGCAGAT 5700 

AATACCGCAC AG AT GCGT AA GGAGAAAATA 5760 

CACTGACTCG CTGCGCTCGG TCGTTCGGCT 5820 

GGTAATACGG TT AT C C AC AG AATCAGGGGA 5880 

CCAGCAAAAG GCCAGGAACC GTAAAAAGGC 5940 

CCCCCCTGAC GAGCATCACA AAAATCGACG 6000 

ACTATAAAGA TACCAGGCGT TTCCCCCTGG 6060 

CCTGCCGCTT ACCGGATACC TGTCCGCCTT 6120 

TAGCTCACGC TGTAGGTATC TCAGTTCGGT 6180 

GCACGAACCC CCCGTTCAGC CCGACCGCTG 6240 

CAACCCGGTA AGACACGACT TATCGCCACT 6 300 

AGCGAGGTAT GTAGGCGGTG CTACAGAGTT 6360 

TAGAAGGACA GTATTTGGTA TCTGCGCTCT 6420 

TGGTAGCTCT TGATCCGGCA AACAAACCAC 6480 

GCAGCAGATT ACGCGCAGAA AAAAAGGATC 6540 

GTCTGACGCT CAGTGGAACG AAAACTCACG 6600 

AAGGATCTTC ACCTAGATCC TTTTAAATTA 6660 

ATATGAGTAA ACTTGGTCTG ACAGTTACCA 6720 

GATCTGTCTA TTTCGTTCAT CCATAGTTGC 6780 

ACGGGAGGGC TTACCATCTG GCCCCAGTGC 6840 

GGCTCCAGAT TTATCAGCAA TAAACCAGCC 6900 

TGCAACTTTA TCCGCCTCCA TCCAGTCTAT 6960 

TTCGCCAGTT AATAGTTTGC GCAACGTTGT 70 20 
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20 



25 



TGCCATTGCT GCAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC 7080 

CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG 7140 

CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT 7200 

TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC 7 260 

TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG 7320 

15 CCCGGCGTCA ACACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT 7 380 

TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC 7440 

GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC 7500 

TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA 7560 

ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTTATTG 7620 

TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG 7680 

CACATTTCCC CGAAAAGTGC CACCTGACGT CTAAGAAACC ATTATTATCA TGACATTAAC 7740 

CTATAAAAAT AGGCGTATCA CGAGGCCCTT TCGTCTTCAA GAATTCATCG AT ATC GG AAA 7800 

ATGAAAAAAA ATATTTTTTA ATTTTAAAAT GAAATGTTTA TTTTCAATTT CTCCAAATTT 7860 

CACAAGGAAA GATTAGTCAC GGGTATGGGA GAGCAGAGGA CCATAAGAGT TCAGGAATAG 7920 

30 AATC C ATT AT GATTCTGGAG TCAAGGAAGT ACTGATGCCA AGGTTTCAGT ATAAGAGCAG 7980 

TATCCACTGG AAAGGATAAA GTCACTACAA CTGAGCACAG AGCAGGACAG CTACCTAATG 8040 

AGTGGTCACT AATGGGCCAC TGTTACACTG TTATACGGCT TAGGAATGAG CACTGAGGCT 8100 

GTGAGGTGTA TGGGTAAGGA CATCAGGATG TAAACCCAGC T C AGGT AG AG GACTCAGAGC 8160 

ACAGCACAAT CAGCACGAAC TAATAAACAA CAGATAAGAT AAGGCACAAG CTCAGCAATA 8220 

TTGGATCAGG GATCTTTGTA AATCTGACTG TGTATTCAGT CTAGTTCAAT GTGACTCATG 8280 

AAGCCCACCC ATATGCAAAT CTAGAGAAGA CTTTAGAGTA TAAATCTGAG GCTCACCTCA 8340 

CATACCAGCA AGGGAGTGAC CAGCTTGTCT TAAGGCACCA CTGAGCCCAA GTCTTAGACA 8400 

TCATGGATTG GCTGTGGAAC TTGCTATTCC TGATGGCAGC TGCCCAAGGT AAGTCATCAG 8460 

AAAAAAGAGT TCCAAGGGAA ATTGAAGCAG TTCCGAGCTC GGTACCCTCG AGATCCTAGA 8520 

45 GAGGTCTGGT GGAGCCTGCA AAAGTCCAGC TTTCAAAGGA ACACAGAAGT ATGTGTATGG 8580 

AATATTAGAA GATGTTGCTT TTACTCTTAA GTTGGTTCCT AGGAAAAATA GTTAAATACT 8640 

GTGACTTTAA AATGTGAGAG GGTTTTCAAG TACTCATTTT TTTAAATGTC CAAAATTTTT 8700 

GTCAATCAAT TTGAGGTCTT GTTTGTGTAG AACTGACATT ACTTAAAGTT TAACCGAGGA 8760 

ATGGGAGTGA GGCTCTCTCA TACCCTATCC AGAACTGACT TTTAACAATA ATAAATTAAG 8820 



35 



40 



50 
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20 



TTTAAAATAT TTTTAAATGA AT TGAGC AAT GTTGAGTTGA 
CCAGAACACC TGCAGCAGCT GGCAGGAAGC AGGTCATGTG 
GAAAATAAAA CCACTAGGTA AACTTGTAGC TGTGGTTTGA 
CTGTCCAGCC CCACCAAACC GAAAGTCCAG GCTGAGCAAA 
ATTTCTAAAA TAAGTTGAGG ATTCAGCCGA AACTGGAGAG 
GTTCAACCTT TTAATTTTAG CTTGAGTAGT TCTAGTTTCC 
TTCTAAAATG TATTTAGAAT T 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



GTCAAGATGG 
GCAAGGCTAT 
AGAAGTGGTT 
ACACCACCTG 
GTCCTCTTTT 
CCAAACTTAA 



CCGATCAGAA 
TTGGGGAAGG 
TTGAAACACT 
GGTAATTTGC 
AACTTATTGA 
GTTTATCGAC 



8880 
8940 
9000 
9060 
9120 
9180 
9201 



25 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGATCCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA GAATGCAGTG 60 

AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA CCATTATAAG 120 

CTGCAATAAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG TTCAGGGGGA 180 

GGTGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTATGG CTGATTATGA 240 

TCTCTAGTCA AGGCACTATA CATCAAATAT TCCTTATTAA CCCCTTTACA AATTAAAAAG 300 

CTAAAGGTAC ACAATTTTTG AGCATAGTTA TTAATAGCAG ACACTCTATG CCTGTGTGGA 360 

GTAAGAAAAA ACAGTATGTT AT G ATT AT AA CTGTTATGCC T AC TT AT AAA GGTTACAGAA 420 

TATTTTTCCA TAATTTTCTT GTATAGCAGT GCAGCTTTTT CCTTTGTGGT GTAAATAGCA 480 

AAGCAAGCAA GAGTTCTATT ACTAAACACA GCATGACTCA AAAAACTTAG CAATTCTGAA 540 

GGAAAGTCCT TGGGGTCTTC TACCTTTCTC TTCTTTTTTG GAGGAGTAGA ATGTTGAGAG 600 

TCAGCAGTAG CCTCATCATC ACTAGATGGC ATTTCTTCTG AGCAAAACAG GTTTTCCTCA 660 

TTAAAGGCAT TCCACCACTG CTCCCATTCA TCAGTTCCAT AGGTTGGAAT CTAAAATACA 7 20 

CAAACAATTA G AATC AG TAG TTTAACACAT TATACACTTA AAAATTTTAT ATTTACCTTA 780 

GAGCTTTAAA TCTCTGTAGG TAGTTTGTCC AATTATGTCA CACCACAGAA GTAAGGTTCC 840 

TTCACAAAGA TCCGGGGCCC ACTCATAAAT CCAGTTGCCG CCACGGTAGC CAATCACCGT 900 

ATCGTATAAA TCATCGTCGG TACGTTCGGC ATCGCTCATC ACAATACGTG CCTGGACGTC 960 
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GAGGATTTCG CGTGGGTCAA TGCCGCGCCA GATCCACATC AGACGGTTAA TCATGCGATA 1020 

CCAGTGAGGG ATGGTTTTAC CATCAAGGGC CGACTGCACA GGCGGTTGTG CGCCGTGATT 1080 

10 

AAAGCGGCGG ACTAGCGTCG AGGTTTCAGG ATGTTTAAAG CGGGGTTTGA ACAGGGTTTC 1140 

GCTCAGGTTT GCCTGTGTCA TGGATGCAGC CTCCAGAATA CTTACTGGAA ACTATTGTAA 1200 

CCCGCCTGAA GTTAAAAAGA ACAACGCCCG GCAGTGCCAG GCGTTGAAAA GATTAGCGAC 1260 

15 CGGAGATTGG CGGGACGAAT ACGACGCCCA TATCCCACGG CTGTTCAATC CAGGTATCTT 13 20 

GCGGGATATC AACAACATAG TCATCAACCA GCGGACGACC AGCCGGTTTT GCGAAGATGG 1380 

TGACAAAGTG CGCTTTTGGA TACATTTCAC GAATCGCAAC CGCAGTACCA CCGGTATCCA 14 40 

CCAGGTCATC AATAACGATG AAGCCTTCGC CATCGCCTTC TGCGCGTTTC AGCACTTTAA 1500 

20 

GCTCGCGCTG GTTGTCGTGA TCGTAGCTGG AAATACAAAC GGTATCGACA TGACGAATAC 1560 

CCAGTTCACG CGCCAGTAAC GCACCCGGTA CCAGACCGCC ACGGCTTACG GCAATAATGC 1620 

CTTTCCATTG TTCAGAAGGC ATCAGTCGGC TTGCGAGTTT ACGTGCATGG ATCTGCAACA 1680 

TGTCCCAGGT GACGATGTAT TTTTCGCTCA TGTGAAGTGT CCCAGCCTGT TTATCTACGG 1740 

CTTAAAAAGT GTTCGAGGGG AAAATAGGTT GCGCGAGATT ATAGAGATCA GCTTTTTGCA 1800 

AAAGCCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG AGGCCGAGGC 1860 

30 GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGGGGCGGAG AATGGGGCGG 1920 

GATGGGCGGA GTTAGGGCGG AACTGGGCGG AGTTAGGGGC GGGACTATGG TTGCTGACTA 1980 

ATTGAGATGC TGCATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT GCGTATTGGG 2040 

CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG 2100 

35 

GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 2160 

AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 2220 

GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCGACG CTCAAGTCAG 2280 

40 

AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC 2340 

GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG 2400 

GGAAGCGTGG CGCTTTCTCA ATGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT 2460 

45 CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC 2520 

GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC 2580 
ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG 2640 
TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCCA 2700 

50 

GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC 2760 

55 
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GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT 2820 

CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTCACG TTAAGGGATT 2880 

10 

TTGGTCATGA G ATT AT C AAA AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT 2 940 

TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC 3000 

AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC 3060 

15 GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA 3120 

CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG 3180 

GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC 3 240 

CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT 3 300 

20 

ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA 3 360 

CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT 3 420 

CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA 3480 

25 

CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC 3540 

TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG CCCGGCGTCA 3600 

AT AC GGGAT A ATACCGCGCC ACATAGCAGA AC TTT AAAAG TGCTCATCAT TGGAAAACGT 3660 

30 TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC 3 720 

ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA 3780 

AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTTGAATA 3840 

35 CTCATACTCT TCCTTTTTCA AT ATT AT TG A AGCATTTATC AGGGTTATTG TCTCATGAGC 3900 

GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC 3960 

CGAAAAGTGC CACCTGACGT CTAAGAAACC ATTATTATCA TGACATTAAC CTATAAAAAT 4020 

AGGCGTATCA CGAGGCCCTT TCGTCTCGCG CGTTTCGGTG ATGACGGTGA AAACCTCTGA 4080 

40 

CACATGCAGC TCCCGGAGAC GGTCACAGCT TGTCTGTAAG CGGATGCCGG GAGCAGACAA 4140 

GCCCGTCAGG GCGCGTCAGC GGGTGTTGGC GGGTGTCGGG GCTGGCTTAA CTATGCGGCA 4200 

TCAGAGCAGA TTGTACTGAG AGTGCACCAT AGCGGCCGCA TATGCGGTGT GAAATACCGC 4260 

45 ACAGATGCGT AAGGAGAAAA TACCGCATCA GGCGCCATTC GCCATTCAGG CTGCGCAACT 4320 

GTTGGGAAGG GCGATCGGTG CGGGCCTCTT CGCTATTACG CC AGAATTCG GCCCAGGGGA 4380 

CTGTGAGGAC AGAAGGCTTG TGGGTTTGAG GGAGGACTGT CTTGCAGAGG ATGATAGGGT 4440 

AAAATAGAAT GAAGGATGAT TTTTATAAAT GGTTACGTGC CTTAGGATGA CTACATATTT 4500 

50 

AGTCCCTTAT AAGAGAAATT GAGTAGTTGG TAAAACAACA GATAATAATT ATTAAATGAG 4560 

55 
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GAAAGAGAGA AACCACAGGT GCAAAGATTC ACTTTATTTA TTCATTCTCC TCCAACATTA 4620 

GCATAATTAA AGCCAAGGAG GAGGAGGGGG GTGAGGTGAA AG ATGAGC TG GAGGACCGCA 4680 

10 

ATAGGGGTAG GTCCCCTGTG GAAAAAGGGT CAGAGGCCAA AGGATGGGAG GGGGTCAGGC 4740 

TGGAACTGAG GAGCAGGTGG GGGCACTTCT CCCTCTAACA CTCTCCCCTG TTGAAGCTCT 4800 

TTGTGACGGG CGAGCTCAGG CCCTGATGGG TGACTTCGCA GGCGTAGACT TTGTGTTTCT 4860 

15 CGTAGTCTGC TTTGCTCAGC GTCAGGGTGC TGCTGAGGCT GTAGGTGCTG TCCTTGCTCT 49 20 

CCTGCTCTGT GACACTCTCC TGGGAGTTAC CCGATTGGAG GGCGTTATCC ACCTTCCACT 4980 

GTACTTTGGC CTCTCTGGGA TAGAAGTTAT TCAGCAGGCA CACAACAGAG GCAGTTCCAG 5040 

ATTTCAACTG CTCATCAGAT GGCGGGAAGA TGAAGACAGA TGGTGCAGCC ACAGTTCCTG 5100 

20 

AGGAAAGAAG CAAACAGGAT GGTGTTTAAG TAACAAAGTT CTGCCCTTGG GTGTGTTGTT 5160 

TGCGGATAAG GGCATGTTAG GGACAGACAG AAAACAGCAT GCTTATCCCA GATAATTATA 52 20 

GCAAGGAGAC CAAGAAGCGT ATTTAAAATC TTGATGTTTT GAGTTTCTTC CTAGCTTCCC 5280 

CCTATTCCTT AATAAAGTTC TAAATTGTTT TGTTGGAGCT CTTTGCAGCC ATTCTGAGGG 5340 

CTTTGCATGC TTTTCTGACC TTGCAGTAAA CTCAATGCTT TAGGCAAAGA ATGGCCACGT 5400 

CATCCGACCC CCTCAGAGTT TAGAATTCAT CGATATCTAG ATCCTAGATA ATTGCATTCA 5460 

30 TTTAAAAAAA AAATATTTCT CCTAAAATGA ATACTCAGAA AGTGGTCTTG AAAAAGATTT 5520 

GTGAAGCCGT TTTGACCAGA ATGTCAAAGT CTTAATAGTA AGGCAAAACA AACAACTAAA 5580 

AAAGATCATG AACAAAGTCA CTGTAAAGAC TTCGGGTATT GGAAAATAAT TGAATGGAGA 5640 

35 CCAATAATCA GAGGGAAGAA TAATAGAGTA ATTTTAAGAA GTTTTCTAAA TATATTAGAA 5700 

ATTAAAGACA CTAAAGTCCT TCAATTTCTT ACATAACCTA ATTTTGAAAA TGAATTCTAA 5760 

ATACATTTTA GAAGTCGATA AACTTAAGTT TGGGGAAACT AGAACTACTC AAGCTAAAAT 5820 

TAAAAGGTTG AACTCAATAA GTTAAAAGAG GACCTCTCCA GTTTCGGCTG AATCCTCAAC 5880 

40 

TTATTTTAGA AATGCAAATT ACCCAGGTGG TGT TTTGCTC AGCCTGGACT TTCGGTTTGG 5940 

TGGGGCTGGA CAGAGTGTTT CAAAACCACT TCTTCAAACC ACAGCTACAA GTTTACCTAG 6000 

TGGTTTTATT TTCCCTTCCC CAAATAGCCT TGCCACATGA CCTGCTTCCT GCCAGCTGCT 6060 

45 GCAGGTGTTC TGGTTCTGAT CGGCCATCTT GACTCAACTC AACATTGCTC AATTCATTTA 6120 

AAAATATTTT AAACTTAATT TATTATTGTT AAAAGTCAGT TCTGGATAGG GTATGAGAGA 6180 

GCCTCACTCC CATTCCTCGG TTAAACTTTA AGTAATGTCA GTTCTACACA AACAAGACCT 6240 

5o CAAATTGATT GACAAAAATT TTGGACATTT AAAAAAATGA GT AC TTG AAA ACCCTCTCAC 6300 

ATTTTAAAGT CACAGTATTT AACTATTTTT CCTAGGAACC AACTTAAGAG TAAAAGCAAC 6360 

55 
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ATCTTCTAAT 


ATT C CAT AC A 


CATACTTCTG 


TGTTCCTTTG 


AAAGCTGGAC 


TTTTGCAGGC 


6420 


TCCACCAGAC 


CTCTCTAGGA 


TCTCGAGCTC 


GCGAAAGCTT 


GCATGCCTGC 


AGGTCGACTC 


6480 


TAGAGGATCA 


AACCAACTGT 


CTTTGAGTAG 


AGCCAAAATT 


GTTGATATAC 


TTTGAATTTT 


6540 


AATTATATTT 


CTTGCTGAGC 


AGAGGTGGCA 


AGAGTTTTCA 


CTAATGTGCA 


AAACCACCTC 


6600 


ATGTTCCCCT 


CACCTGGGAG 


CCAGAGTAGC 


AGGAGGAAGA 


GAAGCTGAGC 


TGGGGCTTCC 


6660 


ATGGTTCCCT 


CTGGGTC CT A 


ACTGAGCAGT 


TCCTCCCCAG 


GGCTCTGACA 


CAGGCATTGA 


6720 


TATGGGCTCT 


GGAAGGTAGG 


GCAGCTGGGA 


GGGACATGCA 


AAGCAGCTGG 


GTGGGAGCTG 


6780 


AGCTTCCAGC 


TGCAGAGACC 


ACCTGCTTCT 


TCCTCTCTGC 


ACTGAGCATC 


CTGCGCCACC 


6840 


CTGGTTGTCA 


GGCCAGAAAA 


GTCTGTTGGC 


TCAGTCTGAG 


TGTAGAACTT 


CTCCCTTGTG 


6900 


CTCAGAGAAT 


TTCATTCCTA 


TGTCTTTCTT 


CTCCTCAATC 


ACCTAAATTC 


ACCCAGATGA 


6960 


TGTTTGGCAC 


AAGCCTGTTA 


AGAACAATAT 


AAAAGGCTGT 


GTTTTCATTT 


CTCTCTTCCT 


7020 


ATCCTCAATA 


TGCCCAGTCA 


TCTCCCTAAG 


TGCATTATT 






7059 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AAAAAAGAAT TCGAGCTCTT C TGAT AACGC TGTCCTTCTG TTTGCAGGTG TCCAGTGTCA 

GGTCCAACTT GTCCAGTCCG G 

(2) INFORMATION FOR SEQ ID NO: 12: 

40 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTAACTTCGG CACCGGACTG GACAAGTTGG ACCTGACACT GGACACCTGC AAACAGAAGG 60 

50 

ACAGCGTTAT CAGAAGAGCT CGAATTCTTT TTT 93 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGCCGAAGTT AAGAAGCCTG GCGCTTCTGT GAAGGTCTCC TGCAAGGCTT CTGGCTACAC 
CTTCACCGAC TACTGGATGA ACTGGGTTCG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCAGGTGCCT GTCGAACCCA GTTCATCCAG TAGTCGGTGA AGGTGTAGCC AGAAGCCTTG 
CAGGAGACCT TCACAGAAGC GCCAGGCTTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ACAGGCACCT GGACAGGGCC TAGAGTGGAT GGGAAGGATT GATCCTTCCG ATAGTGAAAC 
TCACTACAAT CAGAAGTTCC AGGGTAGGGT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGGTCATTG TTACCCTACC CTGGAACTTC TGATTGTAGT GAGTTTCACT ATCGGAAGGA 60 
15 TCAATCCTTC CCATCCACTC TAGGCCCTGT 90 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

25 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

AACAATGACC C GAG AC AC AT CCACCAGCAC AGTCTACATG GAACTCAGCA GCCTGCGATC 60 
TGAGGACACC GCAGTCTATT ACTGTGCACG 90 

30 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
40 AGCCGTCCAC CTCGTGCACA GTAATAGACT GCGGTGTCCT CAGATCGCAG GCTGCTGAGT 60 
TCCATGTAGA CTGTGCTGGT GGATGTGTCT 90 
(2) INFORMATION FOR SEQ ID NO: 19: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGGTGGACGG CTCGGTTCCT TTGCTATGGA CTACTGGGGT CAAGGCACCC TCGTCACCGT 60 

CTCCTCAGGT GAGTCCTCAC ACTCGAGGTC GAC 93 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
AAAAAAGTCG ACCTCGAGTG TGAGGACTCA CCTGAGGAGA CGGTGACGAG GGTGCCTTGA 60 
CCCCAGTAGT CCATAGCAAA GGAACCG 87 
(2) INFORMATION FOR SEQ ID NO: 21: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTTTTTGAAT TCAAGCTTTC CTGACTACAT GAGTGCATTT CTGTTTTATT TCCAATTTCA 60 
GATACCACCG GAGAAAT TGT GCTAACACAA 90 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
5o AATTTCTCCG GTGGTATCTG AAATTGGAAA TAAAACAGAA ATGCACTCAT GTAGTCAGGA 60 

AAGCTTGAAT TCAAAAAA 78 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCTCCAGCTA CATTGTCTTT GTCTCCAGGT GAGAGAGCCA CTCTATCCTG CAGAGCCAGT 
GAAAGTGTTG ATAGTTATGG CAATAGT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ACTATCAACA CTTTCACTGG CTCTGCAGGA TAGAGTGGCT CTCTCACCTG GAGACAAAGA 
CAATGTAGCT GGAGATTGTG TTAGCAC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TTTATGCACT GGTACCAGCA GAAACCAGGA CAGGCACCAA GGCTCCTCAT CTATCGTGCA 
TCCAACCTAG AAACTGGTAT CCCTGCC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE; nucleic acid 
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(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AGTTTCTAGG TTGGATGCAC GATAGATGAG GAGCCTTGGT GCCTGTCCTG GTTTCTGCTG 
GTACCAGTGC ATAAAACTAT TGCCATA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGGTTCAGTG GCAGTGGTTC TAGGACAGAC TTCACTCTCA CCTATTCTTC TCTAGAGCCT 
GAAGATTTTG CAGTGTATTA CTGTCAG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CACTGCAAAA TCTTCAGGCT CTAGAGAAGA ATAGGTGAGA GTGAAGTCTG TCCTAGAACC 
ACTGCCACTG AACCTGGCAG GGATACC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CAAAGTAATG AGGATCCTCG GACGTTCGGT GGAGGCACCA AGGTGGAAGA GAAACGTAAG 60 

TGCACTTTCC TCGAGGTCGA CTTTTTT 87 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 99 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAAAAAGTCG ACCTCGAGGA AAGTGCACTT ACGTTTCTCT TCCACCTTGG TGCCTCCACC 60 
GAACGTCCGA GGATCCTCAT TACTTTGCTG ACAGTAATA 99 
(2) INFORMATION FOR SEQ ID NO: 31: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AAAAAAGAAT TCGAGCTCTT TTCTGATAAC GTTGTCCTTC TGTTTCTTGC AGGTGTCCAG 60 
TGTCAGGTCC AACTTCAGCA GCCTGGG 87 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI-SENSE: YES 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AAAAAAGTCG ACTGTGAGGA CTCACCTGAG GAGACGGTGA CTGAGGTGCC T 51 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
AAAAAAGAAT TCAAGCTTTC CTGACTACAT GAGTGCATTT CTGTTTTATT TCCAATTTCA 
20 GATACCACCG GAGACATTGT GCTAACACAA TCTCCA 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE; YES 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AAAAAAGTCG ACCTCGAGAT CACTTACGTT TGATTTCCAG CTTGGTGCCT CCAC 
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ims 

A method for producing a humanized monoclonal antibody by utilizing a process of comparative model 
building comprising: 

a) selecting a monoclonal antibody to be humanized; 

b) searching computer databanks for protein crystal structures that demonstrate greater than 50 per- 
cent sequence homology to the variable region of said antibody to produce a structural template; 

c) determining the structure of the complementarity determining region, or CDR, loops and assigning 
the loops to canonical loop conformations; 

d) determining the framework residues which are crucial to the conformation of the CDR loops; 

e) replacing the CDR loops of the structural templates with canonical CDR backbone templates using 
interactive computer graphics; 

f) searching computer databanks to extract initial backbone approximations for each loop for non-can- 
onical CDR loops; 

g) replacing all non-conserved amino-acid side chains in similar positions on said antibody and on the 
computer model with human amino acid residues using interactive computer graphics to produce a mod- 
el having a combination of backbone fragments of different antibodies with replaced side chains; 

h) solvating the models with a water layer corresponding to about 7 angstroms; 

i) refining the structure with an energy minimization protocol to produce a structure wherein all atoms 
of the system are freely mobile; 

j) searching computer databanks to find homologous human sequences for the variable light and va- 
riable heavy chains of the antibody; 

k) combining the sequences found in (j) to obtain human templates; 

I) comparing the structural template of (a) with the human templates of (k) and selecting a human template 
with variable regions having greater than 50 percent sequence identity with the structural template; 
m) determining the CDR loops of the human template selected in (I); 

n) replacing the CDR loop region of the selected human template with the analogous sequences from 
the antibody to produce a Phase 1 humanized sequence; 

o) superimposing the models of the antibody and the Phase 1 humanized sequence to compare the 
binding site regions; 

p) identifying by the comparison in (o) all amino acids in the framework residues and CDR junction residues that 
interact with the antibody CDR loops that can be important to the structural integrity of the antibody binding site; 
q) reinserting into the Phase 1 humanized sequence all amino acid residues identified in (p) to be dif- 
ferent from those in the antibody, and refining the resultant structure with an energy minimization pro- 
tocol to produce a Phase II humanized sequence; 

r) refining the Phase II humanized sequence using iterative conformational search protocols on all re- 
gions of the binding site and by analysis of the binding site to determine which regions of the CDR sur- 
face or residues at the CDR -framework junction are not likely to involve antigen binding; and 
s) replacing the amino acids in the non-antigen binding regions of the binding site with amino acid re- 
sidues corresponding to the human residues to produce a humanized monoclonal antibody. 

The method of Claim 1 wherein the monoclonal antibody is a murine antibody. 

The method of Claim 2 wherein the monoclonal antibody is an anti-CD18 monoclonal antibody. 

The method of Claim 3 wherein the monoclonal antibody is 60.3 

A humanized monoclonal antibody having the structural and binding characteristics of the anti- CD18 
monoclonal antibody 60.3 

The humanized monoclonal antibody of Claim 5 wherein the amino acid at position 50 in Figure 2 is 
changed from Arg to Asp. 

The humanized monoclonal antibody of Claim 6, wherein the amino acid at position 54 is changed from 
Leu to Arg and the amino acid at position 55 is changed from Glu to Ala. 

The humanized monoclonal antibody of Claim 7 wherein the amino acid at position 68 is changed from Arg to Gly. 
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60.3 Heavy Chain Sequences 



Kabul hVhl/ 21*2 H60J1 h6C.3 MO J H60.3 

* JM 'CL aamplata PhaaalV PhaaaUTItl Phaaal me03 



1 


















2 




j Va? 


:: :':;VaJ 
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: V 'Val 

; ; :-: ; .Gin : I. 


Vai . 1| 
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3 




^ Gin - : : 


-Gin •■ 
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I Lau ; 
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" . . v Lau - 
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6 
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Pro 
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| Aap 
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Lau 
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lillPra:; 
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16 
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17 
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20 




v- • VaJ : 




;;^?|vai' 


ii^l^i^^iVaj;:'::^! 


••: ; t -Vai 


:<: : wVaJ : ;'-s^ 


Lau 


21 






Sar 


lllliar^; 


||||;;|sar;:"|f 


'•••Sar . :\ 




;;;:r';^Br:;vt'; 
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60.3 Heavy Chain Sequences 



hVW/ 21-2 H60.3 H60.3 H6CJ H50.3 
Jh4 -CL template PhaM IV PhaaaU/HI Phaaal 



m60.3 



45 


i< ■■ 
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60.3 Heavy Chain Sequences 



Kabat hVhl/ 21-3 H60.3 h60J h60* h60.3 
« Jh4 *CL template Phaaa IV Phase U/lII Phaaal m60.3 



HV3 



FR4 




FR4 



* The H60.3 heavy chain was made with the toque nee shown in the 
Phase Mil column, with the following exception: 
H71: Arg instead of Val 



FIGURE 1 3/3 
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60.3 Light Chain Sequences 



Kabat hVkllU l*0.3 H60.3 h60.3 H60.3 
# Jk templata Phaaa IV Phaaa 11/UI PhiMl m60.3 



FR1 



HV1 



FR2 
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36 I 
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60.3 Light Chain Sequences 



Kftbat hVUIV h60J hG0.3 b6C3 K60.3 
# Jk template PhaaalV Phaaa Mil Phasat mSCA 
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60.3 Light Chain Sequences 



Kabat hVkltV h60J N60.3 H60.3 hSO.3 
# Jk template PhaaalV Phase Mil Phaaal ro60.3 



Asp ASD A«P : 

Pbe .-Pfcell Val 

• >u - • Ai* r t9mm': 

Tyr \ Tyr Tyr 

;Cys Cys Cys 

£&n Gin G(n 

CUn • . Gin Gin 

£er &sr .6er 

Asn Asn . Asn 

HV3 93 Asn §^ : m* Qiu Giu QJu 

:;:Asp Mp Asp 

Pro Pro Pro 

. Arg - Arg 



FR4 



82 




Wpsp 




83 




..Phe... 


li^Pha; 


84 








85 


iiivai:^!; 


.• w : : ' 




86 






11M 


87 
88 


.•"Tyr • 
Cys 


■: : i:::': : ::;Tyr 
Cys 


•• :SnTyr ;. 


89 




GJn 




90 




. .yto . , 


1 : Gin 


91 




A* 


>• Ser 


92 




Ser 




93 




Asn 


94 






|; : -"A$p ; 


95 




Pro 


llfPml: 


96 


Trp r Tyr, 




If Arg ' 




Phe, Leu. He 




97 


Thr 




Thr 


98 


llf^':l| : 




^.-•• ;: -*Pha" : 


99 \ 






: GJy • 


100 | 






101 f 








102 | 


Thr 






103 


tys,Ar$ 


E 


mi** 


104 \ 


VaUw 






105 I 


•iG^'Asp"- 






106 1 


He 






107 I 








108 I 


.: : '\Arfl 11 









Thr 




; : -:;-;Pf* 


^Mg[*piS^ - Phe 






Giy * 




"<3ly : . : v;:i;: : 




Cly 






:::. : ^Trv 


Lys : 

* i iuw ; 


'•Thr 


wmm. 


"¥l! 


••: tau": 










" : "*e* 


He 


;* m 






m m 




Arg 





* The h60.3 light chain was made with the sequence shown in the 
Phase Mil column, with the following exceptions: 



L68: 


Arg \* 


instead of 


Gly 


L75: 


Tjr 


instead of 


Be 


L104: 


•Val 


instead of 


Leu 


L106: 


Glu 


instead of 


Be 
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FIGURE 3 

Oligonucleotide #1: 

lillllGAATTCAAGCTTTCCTGACTACATGAGTGCATTTCTGllllATTTCCAAT 
TTCAGATACCACCGGAGAAATrGTGCTAACACAA 

Oligonucleotide #2 

AATTTCTCCGGTGGTATCTGAAATTGGAAATAAAACAGAAATGCACTCATGTAG 
TCAGGAAAGCTTGAATTCAAAAAA 

Oligonucleotide #3 

TCTCCAGCTACATTGTCTTTGTCTCCAGGTGAGAGAGCCACTCTATCCTGCAGAG 
CCAGTGAAAGTGTTGATAGTTATGGCAATAGT 

Oligonucleotide #4 

ACTATCAACACTTTCACTGGCrCTGCAGGATAGAGTGGCTCTCTCACCTGGAGAC 
AAAGACAATGTAGCTGGAGATTGTGTTAGCAC 

Oligonucleotide #5 

TTTATGCACTGGTACCAGCAGAAACCAGGACAGGCACCAAGGCTCCTCATCTAT 
CGTGCATCCAACCTAGAAACTGGTATCCCTGCC 

Oligonucleotide #6 

AGTTTCTAGGTTGGATGCACGATAGATGAGGAGCCTrGGTGCCTGTCCTGGTTTC 
TGCTGGTACCAGTGCATAAAACTATTGCCATA 

Oligonucleotide #7 

AGGTTCAGTGGCAGTGGTTCTAGGACAGACTTCACTCTCACCTATTCTTCTCTAG 
AGCCTGAAGATTTTGCAGTGTATTACTGTCAG 
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Oligonucleotide #8 

CACTGCAAAATCTTCAGGCTCTAGAGAAGAATAGGTGAGAGTGAAGTCTGTCCT 
AGAACCACTGCCACTGAACCTGGCAGGGATACC 

Oligonucleotide #9 

CAAAGTAATGAGGATCCTCGGACGTTCGGTGGAGGCACCAAGGTGGAAGAGAA 
ACGTAAGTGCACTITCCTCGAGGTCGACTTTTTT 

Oligonucleotide #10 

AAAAAAGTCGACCTCGAGGAAAGTGCACTTACGTTTCTCTTCCACCrTGGTGCCT 
CCACCGAACGTCCGAGGATCCTCATTACTTTGCTGACAGTAATA 
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FIGURE 4 

Oligonucleotide # 1 

AAT^AAAGAATTCGAGCTCTTCTGATAACGCTGTCCTTCTGTTTGCAGGTGTCCAG 
TGTCAGGTCCAACTTGTCCAGTCCGG 



Oligonucleotide #2 

TTAACTTCGGCACCGGACTGGACAAGTTGGACCTGACACTGGACACCTGCAAAC 
AGAAGGACAGCGTTATCAGAAGAGCTCGAATTCTITTTT 



Oligonucleotide #3 

TGCCGAAGTTAAGAAGCCTGGCGCTTCTGTGAAGGTCTCCTGCAAGGCTTCTGG 
CTACACCTTCACCGACTACTGGATGAACTGGGTTCG 



Oligonucleotide #4 

CCAGGTGCCTGTCGAACCCAGTTCATCCAGTAGTCGGTGAAGGTGTAGCCAGAA 
GCCTrGCAGGAGACCTTCACAGAAGCGCCAGGCTTC 



Oligonucleotide #5 

ACAGGCACCTGGACAGGGCCTAGAGTGGATGGGAAGGAlTGATCCTrCCGATAG 
TGAAACTCACTACCATCAGAAGTTCCAGGGTAGGGT 



Oligonucleotide #6 

CGGGTCATTGTTACCCTACCCTGGAACTTCTGATTGTAGTGAGTTTCACTATCGG 
AAGGATCAATCCTTCCCATCCACTCTAGGCCCTGT 

Oligonucleotide #7 

AACAATGACCCGAGACACATCCACCAGCACAGTCTACATGGAACTCAGCAGCCT 
GCGATCTGAGGACACCGCAGTCTATTACTGTGCACG 



48 



EP0 578 515A2 



Oligonucleotide #8 

AGCCGTCCACCTCGTGCACAGTAATAGACTGCGGCGTCCTCAGATCGCAGGCTG 
CTGAGTTCCATGTAGACTGTGCTGGTGGATGTGTCT 

Oligonucleotide #9 

AGGTGGACGGCTCGGTTCCTTTGCTATGGACTACTGGGGTCAAGGCACCCTCGTC 
ACCGTCTCCTCAGGTGATGCCTCACACTCGAGGTCGACri'lTl'l' 

Oligonucleotide #10 

AAAAAAGTCGACCTCGAGTGTGAGGACTCACCTGAGGAGACGGTGACGAGGGT 
GCCTTGACCCCAGTAGTCCATAGCAAAGGAACCG 
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FACS Analysis 
Binding of FITC-m60.3 to HL60 cells 
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pGk.11 



1 GAATTC 

51 

101 

151 

201 

251 : 

351 

401 

451 • 

701 

751 

851 

901 

951 

1001 

1051 

1101 

1151 ? " 

1201 

1251 

1301 ' 

135.1 

14 or 

1451 

1501 

1551 • 

1601 : 

1651 

1701 GGCC CAGGGGACTG TGAGGACAGA AGGCTTGTGG GTTTGAGGGA 

1751 GGACTGTCTT GCAGAGGATG ATAGGGTAAA ATAGAATGAA GGATGATTTT 
1801 TATAAATGGT TACGTGCCTT AGGATGACTA CATATTTAGT CCCTTATAAG 
1851 AGAAATTGAG TAGTTGGTAA AACAACAGAT AATAATTATT AAATGAGGAA 
1901 AGAGAGAAAC CACAGGTGCA AAGATTCACT TTATTTATTC ATTCTCCTCC 
1951 AACATTAGCA TAATTAAAGC CAAGGAGGAG GAGGGGGGTG AGGTGAAAGA 
2001 TGAGCTGGAG GACCGCAATA GGGGTAGGTC CCCTGTGGAA AAAGGGTCAG 
2051 AGGCCAAAGG ATGGGAGGGG GTCAGGCTGG AACTGAGGAG CAGGTGGGGG 
2101 CACTTCTCCC TCTAACACTC TCCCCTGTTG AAGCTCTTTG TGACGGGCGA 
2151 GCTCAGGCCC TGATGGGTGA CTTCGCAGGC GTAGACTTTG TGTTTCTCGT 
2201 AGTCTGCTTT GCTCAGCGTC AGGGTGCTGC TGAGGCTGTA GGTGCTGTCC 
2251 TTGCTCTCCT GCTCTGTGAC ACTCTCCTGG GAGTTACCCG ATTGGAGGGC 
2301 GTTATCCACC TTCCACTGTA CTTTGGCCTC TCTGGGATAG AAGTTATTCA 
2351 GCAGGCACAC AACAGAGGCA GTTCCAGATT TCAACTGCTC ATCAGATGGC 
24 01 GGGAAGATGA AGACAGATGG TGCAGCCACA GTTCCTGAGG AAAGAAGCAA 
2451 ACAGGATGGT GTTTAAGTAA CAAAGTTCTG CCCTTGGGTG TGTTGTTTGC 
2501 GGATAAGGGC ATGTTAGGGA CAGACAGAAA ACAGCATGCT TATCCCAGAT 
2551 AATTATAGCA AGGAGACCAA GAAGCGTATT TAAAATCTTG ATGTTTTGAG 
2 601 TTTCTTCCTA GCTTCCCCCT ATTCCTTAAT AAAGTTCTAA ATTGTTTTGT 
2 651 TGGAGCTCTT TGCAGCCATT CTGAGGGCTT TGCATGCTTT TCTGACCTTG 
2701 CAGTAAACTC AATGCTTTAG GCAAAGAATG GCCACGTCAT CCGACCCCCT 
2751 CAGAGTTTAG AATTCATCGA TATCTAGATC CTAGATAATT GCATTCATTT 
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pGk:11 



2801 AAAAAAAAAA TATTTCTCCT AAAATGAATA CTCAGAAAGT GGTCTTGAAA 
2 851 AAGATTTGTG AAGCCGTTTT GACCAGAATG TCAAAGTCTT AATAGTAAGG 
2 901 CAAAACAAAC AACTAAAAAA GATCATGAAC AAAGTCACTG TAAAGACTTC 

2 951 GGGTATTGGA AAATAATTGA ATGGAGACCA ATAATCAGAG GGAAGAATAA 
3001 TAGAGTAATT TTAAGAAGTT TTCTAAATAT ATTAGAAATT AAAGACACTA 
3051 AAGTCCTTCA ATTTCTTACA TAACCTAATT TTGAAAATpA ATTCTAAATA 
3101 CATTTTAGAA GTCGATAAAC TTAAGTTTGG GGAAACTAGA ACTACTCAAG 
3151 CTAAAATTAA AAGGTTGAAC TCAATAAGTT AAAAGAGGAC CTCTCCAGTT 
3201 TCGGCTGAAT CCTCAACTTA TTTTAGAAAT GCAAATTACC CAGGTGGTGT 
3251 TTTGCTCAGC CTGGACTTTC GGTTTGGTGG GGCTGGACAG AGTGTTTCAA 
3301 AACCACTTCT TCAAACCACA GCTACAAGTT TACCTAGTGG TTTTATTTTC 
3351 CCTTCCCCAA ATAGCCTTGC CACATGACCT GCTTCCTGCC AGCTGCTGCA 
34 01 GGTGTTCTGG TTCTGATCGG CCATCTTGAC TCAACTCAAC ATTGCTCAAT 
34 51 TCATTTAAAA ATATTTTAAA CTTAATTTAT TATTGTTAAA AGTCAGTTCT 
3501 GGATAGGGTA TGAGAGAGCC TCACTCCCAT TCCTCGGTTA AACTTTAAGT 
3551 AATGTCAGTT CTACACAAAC AAGACCTCAA ATTGATTGAC AAAAATTTTG 
3601 GACATTTAAA AAAATGAGTA CTTGAAAACC CTCTCACATT TTAAAGTCAC 

3 651 AGTATTTAAC TATTTTTCCT AGGAACCAAC TTAAGAGTAA AAGCAACATC 
37 01 TTCTAATATT CCATACACAT ACTTCTGTGT TCCTTTGAAA GCTGGACTTT 
3751 TGCAGGCTCC ACCAGACCTC TCTAGGATCT CGAGCTCGCG AAAGCTTGCA 
3801 TGCCTGCAGG TCGACTCTAG AGGATCAAAC CAACTGTCTT TGAGTAGAGC 
36 51 CAAAATTGTT GATATACTTT GAATTTTAAT TATATTTCTT GCTGAGCAGA 
3901 GGTGGCAAGA GTTTTCACTA ATGTGCAAAA CCACCTCATG TTCCCCTCAC 

3 951 CTGGGAGCCA GAGTAGCAGG AGGAAGAGAA GCTGAGCTGG GGCTTCCATG 

4 001 GTTCCCTCTG GGTCCTAACT GAGCAGTTCC TCCCCAGGGC TCTGACACAG 
4 051 GCATTGATAT GGGCTCTGGA AGGTAGGGCA GCTGGGAGGG ACATGCAAAG 
4101 CAGCTGGGTG GGAGCTGAGC TTCCAGCTGC AGAGACCACC TGCTTCTTCC 
4151 TCTCTGCACT GAGCATCCTG CGCCACCCTG GTTGTCAGGC CAGAAAAGTC 
4 201 TGTTGGCTCA GTCTGAGTGT AGAACTTCTC CCTTGTGCTC AGAGAATTTC 
4251 ATTCCTATGT CTTTCTTCTC CTCAATCACC TAAATTCACC CAGATGATGT 
4301 TTGGCACAAG CCTGTTAAGA ACAATATAAA AGGCTGTGTT TTCATTTCTC 
4351 TCTTCCTATC CTCAATATGC CCAGTCATCT CCCTAAGTGC ATTATTGGAT 
4 4 01 CCAGACATGA TAAGATACAT TGATGAGTTT GGACAAACCA CAACTAGAAT 
4 451 GCAGTGAAAA AAATGCTTTA TTTGTGAAAT TTGTGATGCT ATTGCTTTAT 
4 501 TTGTAACCAT TATAAGCTGC AATAAACAAG TTAACAACAA CAATTGCATT 
4 551 CATTTTATGT TTCAGGTTCA GGGGGAGGTG TGGGAGGTTT TTTAAAGCAA 
4 601 GTAAAACCTC TACAAATGTG GTATGGCTGA TTATGATCTC TAGTCAAGGC 
4 651 ACTATACATC AAATATTCCT TATTAACCCC TTTACAAATT AAAAAGCTAA 
4701 AGGTACACAA TTTTTGAGCA TAGTTATTAA TAGCAGACAC TCTATGCCTG 
4751 TGTGGAGTAA GAAAAAACAG TATGTTATGA TTATAACTGT TATGCCTACT 
4 801 TATAAAGGTT ACAGAATATT TTTCCATAAT TTTCTTGTAT AGCAGTGCAG 
4 851 CTTTTTCCTT TGTGGTGTAA ATAGCAAAGC AAGCAAGAGT TCTATTACTA 
4 901 AACACAGCAT GACTCAAAAA ACTTAGCAAT TCTGAAGGAA AGTCCTTGGG 
4 951 GTCTTCTACC TTTCTCTTCT TTTTTGGAGG AGTAGAATGT TGAGAGTCAG 
5001 CAGTAGCCTC ATCATCACTA GATGGCATTT CTTCTGAGCA AAACAGGTTT 
5051 TCCTCATTAA AGGCATTCCA CCACTGCTCC CATTCATCAG TTCCATAGGT 
5101 TGGAATCTAA AATACACAAA CAATTAGAAT CAGTAGTTTA ACACATTATA 
5151 CACTTAAAAA TTTTATATTT ACCTTAGAGC TTTAAATCTC TGTAGGTAGT 
5201 TTGTCCAATT ATGTCACACC ACAGAAGTAA GGTTCCTTCA CAAAGATCCG 
5251 GGGCCCACTC ATAAATCCAG TTGCCGCCAC GGTAGCCAAT CACCGTATCG 
5301 TATAAATCAT CGTCGGTACG TTCGGCATCG CTCATCACAA TACGTGCCTG 
5351 GACGTCGAGG ATTTCGCGTG GGTCAATGCC GCGCCAGATC C AC AT C AG AC 
54 01 GGTTAATCAT GCGATACCAG TGAGGGATGG TTTTACCATC AAGGGCCGAC 
5451 TGCACAGGCG GTTGTGCGCC GTGATTAAAG CGGCGGACTA GCGTCGAGGT 
5501 TTCAGGATGT TTAAAGCGGG GTTTGAACAG GGTTTCGCTC AGGTTTGCCT 
5551 GTGTCATGGA TGCAGCCTCC AGAATACTTA CTGGAAACTA TTGTAACCCG 
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pGk.11 

2801 AAAAAAAAAA TATTTCTCCT AAAATGAATA CTCAGAAAGT GGTCTTGAAA 
2851 AAGATTTGTG AAGCCGTTTT GACCAGAATG TCAAAGTCTT AATAGTAAGG 
2 901 CAAAACAAAC AACTAAAAAA GATCATGAAC AAAGTCACTG TAAAGACTTC 

2 951 GGGTATTGGA AAATAATTGA ATGGAGACCA ATAATCAGAG GGAAGAATAA 
3001 TAGAGTAATT TTAAGAAGTT TTCTAAATAT ATTAGAAATT AAAGACACTA 
3051 AAGTCCTTCA ATTTCTTACA TAACCTAATT TTGAAAATpA ATTCTAAATA 
3101 CATTTTAGAA GTCGATAAAC TTAAGTTTGG GGAAACTAGA ACTACTCAAG 
3151 CTAAAATTAA AAGGTTGAAC TCAATAAGTT AAAAGAGGAC CTCTCCAGTT 
3201 TCGGCTGAAT CCTCAACTTA TTTTAGAAAT GCAAATTACC CAGGTGGTGT 
3251 TTTGCTCAGC CTGGACTTTC GGTTTGGTGG GGCTGGACAG AGTGTTTCAA 
3301 AACCACTTCT TCAAACCACA GCTACAAGTT TACCTAGTGG TTTTATTTTC 
3351 CCTTCCCCAA ATAGCCTTGC CACATGACCT GCTTCCTGCC AGCTGCTGCA 
34 01 GGTGTTCTGG TTCTGATCGG CCATCTTGAC TCAACTCAAC ATTGCTCAAT 
34 51 TCATTTAAAA ATATTTTAAA CTTAATTTAT TATTGTTAAA AGTCAGTTCT 
3501 GGATAGGGTA TGAGAGAGCC TCACTCCCAT TCCTCGGTTA AACTTTAAGT 
3551 AATGTCAGTT CTACACAAAC AAGACCTCAA ATTGATTGAC AAAAATTTTG 

3 601 GACATTTAAA AAAATGAGTA CTTGAAAACC CTCTCACATT TTAAAGTCAC 
3 651 AG TATTTAAC TATTTTTCCT AGGAACCAAC TTAAGAGTAA AAGCAACATC 
37 01 TTCTAATATT CCATACACAT ACTTCTGTGT TCCTTTGAAA GCTGGACTTT 
3751 TGCAGGCTCC ACCAGACCTC TCTAGGATCT CGAGCTCGCG AAAGCTTGCA 
3801 TGCCTGCAGG TCGACTCTAG AGGATCAAAC CAACTGTCTT TGAGTAGAGC 
3 851 CAAAATTGTT GATATACTTT GAATTTTAAT TATATTTCTT GCTGAGCAGA 
3 901 GGTGGCAAGA GTTTTCACTA ATGTGCAAAA CCACCTCATG TTCCCCTCAC 

3 951 CTGGGAGCCA • GAGTAGCAGG AGGAAGAGAA GCTGAGCTGG GGCTTCCATG 

4 001 GTTCCCTCTCS GGTCCTAACT GAGCAGTTCC TCCCCAGGGC TCTGACACAG 
4 051 GCATTGATAT GGGCTCTGGA AGGTAGGGCA GCTGGGAGGG ACATGCAAAG 
4101 CAGCTGGGTG GGAGCTGAGC TTCCAGCTGC AGAGACCACC TGCTTCTTCC 
4151 TCTCTGCACT GAGCATCCTG CGCCACCCTG GTTGTCAGGC CAGAAAAGTC 
4 201 TGTTGGCTCA GTCTGAGTGT AGAACTTCTC CCTTGTGCTC AGAGAATTTC 
4251 ATTCCTATGT CTTTCTTCTC CTCAATCACC TAAATTCACC CAGATGATGT 
4 301 TTGGCACAAG CCTGTTAAGA ACAATATAAA AGGCTGTGTT TTCATTTCTC 
4 351 TCTTCCTATC CTCAATATGC CCAGTCATCT CCCTAAGTGC ATTATTGGAT 
4 4 01 CCAGACATGA TAAGATACAT TGATGAGTTT GGACAAACCA CAACTAGAAT 
4 4 51 GCAGTGAAAA AAATGCTTTA TTTGTGAAAT TTGTGATGCT ATTGCTTTAT 
4 501 TTGTAACCAT TATAAGCTGC AATAAACAAG TTAACAACAA CAATTGCATT 
4551 CATTTTATGT TTCAGGTTCA GGGGGAGGTG TGGGAGGTTT TTTAAAGCAA 
4 601 GTAAAACCTC TACAAATGTG GTATGGCTGA TTATGATCTC TAGTCAAGGC 
4 651 AC TAT AC AT C AAATATTCCT TATTAACCCC TTTACAAATT AAAAAGCTAA 
4701 AGGTACACAA TTTTTGAGCA TAGTTATTAA TAGCAGACAC TCTATGCCTG 
4751 TGTGGAGTAA GAAAAAACAG TATGTTATGA TTATAACTGT TATGCCTACT 
4 801 TATAAAGGTT ACAGAATATT TTTCCATAAT TTTCTTGTAT AGCAGTGCAG 
4 851 CTTTTTCCTT TGTGGTGTAA ATAGCAAAGC AAGCAAGAGT TCTATTACTA 
4 901 AACACAGCAT GACTCAAAAA ACTTAGCAAT TCTGAAGGAA AGTCCTTGGG 
4 951 GTCTTCTACC TTTCTCTTCT TTTTTGGAGG AGTAGAATGT TGAGAGTCAG 
5001 CAGTAGCCTC ATCATCACTA GATGGCATTT CTTCTGAGCA AAACAGGTTT 
5051 TCCTCATTAA AGGCATTCCA CCACTGCTCC CATTCATCAG TTCCATAGGT 
5101 TGGAATCTAA AATACACAAA CAATTAGAAT CAGTAGTTTA AC AC ATT AT A 
5151 CACTTAAAAA TTTTATATTT ACCTTAGAGC TTTAAATCTC TGTAGGTAGT 
5201 TTGTCCAATT ATGTCACACC ACAGAAGTAA GGTTCCTTCA CAAAGATCCG 
5251 GGGCCCACTC ATAAATCCAG TTGCCGCCAC GGTAGCCAAT CACCGTATCG 
5301 TATAAATCAT CGTCGGTACG TTCGGCATCG CTCATCACAA TACGTGCCTG 
5351 GACGTCGAGG ATTTCGCGTG GGTCAATGCC GCGCCAGATC CACATCAGAC 
54 01 GGTTAATCAT GCGATACCAG TGAGGGATGG TTTTACCATC AAGGGCCGAC 
5451 TGCACAGGCG GTTGTGCGCC GTGATTAAAG CGGCGGACTA GCGTCGAGGT 
5501 TTCAGGATGT TTAAAGCGGG GTTTGAACAG GGTTTCGCTC AGGTTTGCCT 
5551 GTGTCATGGA TGCAGCCTCC AGAATACTTA CTGGAAACTA TTGTAACCCG 
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5 601 CCTGAAGTTA AAAAGAACAA CGCCCGGCAG TGCCAGGCGT TGAAAAGATT 
5 651 AGCGACCGGA GATTGGCGGG ACGAATACGA CGCCCATATC CCACGGCTGT 
5701 TCAATCCAGG TATCTTGCGG GATATCAACA ACATAGTCAT CAACCAGCGG 
5751 ACGACCAGCC GGTTTTGCGA AGATGGTGAC AAAGTGCGCT TTTGGATACA 
5801 TTTCACGAAT CGCAACCGCA GTACCACCGG TATCCACCAG GTCATCAATA 

5 851 ACGATGAAGC CTTCGCCATC GCCTTCTGCG CGTTTCAGCA CTTTAAGCTC 
5901 GCGCTGGTTG TCGTGATCGT AGCTGGAAAT ACAAACGGTA TCGACATGAC 
5951 GAATACCCAG TTCACGCGCC AGTAACGCAC CCGGTACCAG ACCGCCACGG 
6001 CTTACGGCAA XAATGCCTTT CCATTGTTCA GAAGGCATCA GTCGGCTTGC 
€051 GAGTTTACGT GCATGGATCT GCAACATGTC CCAGGTGACG ATGTATTTTT 
6101 CGCTCATGTG AAGTGTCCCA GCCTGTTTAT CTACGGCTTA AAAAGTGTTC 
6151 GAGGGGAAAA TAGGTTGCGC GAGATTATAG AGATCAGCTT TTTGCAAAAG 
6201 CCTAGGCCTC CAAAAAAGCC TCCTCACTAC 3TCTGGAATA GCTCAGAGGC 
6251 CGAGGCGGCC TCGGCCTCTG CATAAATAAA AAAAATTAGT CAGCCATGGG 
6301 GCGGAGAATG GGGCGGGATG GGCGGAGTTA GGGCGGAACT GGGCGGAGTT 
6351 AGGGGCGGGA CTATGGTTGC TGACTAATTG AGATGCTGCA TTAATGAATC 
64 01 GGCCAACGCG CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC 
64 51 CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT 
6501 CAGCTCACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC. 
6551 GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA GGAACCGTAA 

6 601 AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 
6651 AT C AC AAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA 
6701 TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT 
6751 TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC CCTTCGGGAA 
6801 GCGTGGCGCT TTCTCAATGC TCACGCTGTA GGTATCTCAG TTCGGTGTAG 
6851 GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG TTCAGCCCGA 
6901 CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 
6951 ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG 

7 001 AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG 
7 051 CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA 
7101 CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT 
7151 GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC GCAGAAAAAA 
7201 AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT GACGCTCAGT 
7251 GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT ATCAAAAAGG 
7301 ATCTTCACCT AGATCCTTTT AAATT AAAAA TGAAGTTTTA AATCAATCTA 
7351 AAGTATATAT GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG 
7401 AGGCACCTAT CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA 
7451 CTCCCCGTCG TGTAGATAAC TACGATACGG GAGGGCTTAC CATCTGGCCC 
7 501 CAGTGCTGCA ATGATACCGC GAGACCCACG CTCACCGGCT CCAGATTTAT 
7 551 CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA 
7 601 ACTTTATCCG CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT 
7 651 AAGTAGTTCG CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG 
7701 GCATCGTGGT GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT 
7751 TCCCAACGAT CAAGGCGAGT TACATGATCC CCCATGTTGT GCAAAAAAGC 
7 801 GGTTAGCTCC TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG 
7 851 TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT TACTGTCATG 
7 901 CCATCCGTAA GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT 
7 951 CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC 
8001 GGGATAATAC CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA 
8051 AAACGTTCTT CGGGGCGAAA ACTCTCAAGG ATCTTACCGC TGTTGAGATC 
8101 CAGTTCGATG TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA 
8151 CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA 
8201 AAAAAGGGAA TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT 
8251 TTTTCAATAT TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT 
8301 ACATATTTGA ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA 
8351 TTTCCCCGAA AAGTGCCACC TGACGTCTAA GAAACCATTA TTATCATGAC 
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1 AAGCTTTCTG GGGCAGGCCA GGCCTGACCT TGGCTTTGGG GCAGGGAGGG 
51 GGCTAAGGTG AGGCAGGTGG CGCCAGCCAG GTGCACACCC AATGCCCATG 
101 AGCCCAGACA CTGGACGCTG AACCTCGCGG ACAGTTAAGA ACCCAGGGGC 
151 CTCTGCGCCC TGGGCCCAGC TCTGTCCCAC ACCGCGGTCA CATGGCACCA 
201 CCTCTCTTGC AGCCTCCACC AAGGGCCCAT CGGTCTTCCC CCTGGCACCC 
251 TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGGCT GCCTGGTCAA 
301 GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA GGCGCCCTGA 
351 CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
401 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC 
451 CTACATCTGC AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAAC 
501 GCGTTGGTGA GAGGCCAGCA CAGGGAGGGA GGGTGTCTGC TGGAAGCCAG 
551 GCTCAGCGCT CCTGCCTGGA CGCATCCCGG CTATGCAGCC CCAGTCCAGG 
601 GCAGCAAGGC AGGCCCCGTC TGCCTCTTCA CCCGGAGGCC TCTGCCCGCC 
651 CCACTCATGC TCAGGGAGAG GGTCTTCTGG CTTTTTCCCC AGGCTCTGGG 
701 CAGGCACAGG CTAGGTGCCC CTAACCCAGG CCCTGCACAC AAAGGGGCAG 
751 GTGCTGGGCT CAGACCTGCC AAGAGCCATA TCCGGGAGGA CCCTGCCCCT 
801 GACCTAAGCC CACCCCAAAG GCCAAACTCT CCACTCCCTC AGCTCGGACA 
851 CCTTCTCTCC TCCCAGATTC CAGTAACTCC CAATCTTCTC TCTGCAGAGC 
901 CCAAATCTTG TGACAAAACT CACACATGCC CACCGTGCCC AGGTAAGCCA 
951 GCCCAGGCCT CGCCCTCCAG CTCAAGGCGG GACAGGTGCC CTAGAGTAGC 
1001 CTGCATCCAG GGACAGGCCC CAGCCGGGTG CTGACACGTC CACCTCCATC 
1051 TCTTCCTCAG CACCTGAACT CCTGGGGGGA CCGTCAGTCT TCCTCTTCCC 
1101 CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCGT GAGGTCACAT 
1151 GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 
1201 TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA 
1251 GCAGTACAAC AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC 
1301 AGGACTGGCT GAATGGCAAG GAGTACAAGT GCAAGGTCTC CAACAAAGCC 
1351 CTCCCAGCCC CCATCGAGAA AACCATCTCC AAAGCCAAAG GTGGGACCCG 
14 01 TGGGGTGCGA GGGCCACATG GACAGAGGCC GGCTCGGCCC ACCCTCTGCC 
1451 CTGAGAGTGA CCGCTGTACC AACCTCTGTC CCTACAGGGC AGCCCCGAGA 
1501 ACCACAGGTG TACACCCTGC CCCCATCTAG AGAGGAGATG ACCAAGAACC 
1551 AGGTCAGCCT GACCTGCCTG GTCAAAGGCT TCTATCCCAG CGACATCGCC 
1601 GTGGAGTGGG AGAGCAATGG GCAGCCGGAG AACAACTACA AGACCACGCC 
1651 TCCCGTGCTG GACTCCGACG GCTCCTTCTT CCTCTACAGC AAGCTCACCG 
1701 TGGACAAGAG CAGGTGGCAG CAGGGGAACG TCTTCTCATG CTCCGTGATG 
1751 CATGAGGCTC TGCACAACCA CTACACGCAG AAGAGCCTCT CCCTGTCTCC 
1801 GGGTAAATGA GTGCGACGGC CGGCAAGCCC CCGCTCCCCG GGCTCTCGCG 
1851 GTCGCACGAG GATGCTTGGC ACGTACCCCC TGTACATACT TCCCGGGCGC 
1901 CCAGCATGGA AATAAAGCAC CCAGCGCTGC CCTGGGCCCC TGCGAGACTG 
1951 TGATGGTTCT TTCCACGGGT CAGGCCGAGT CTGAGGCCTG AGTGGCATGA 
2001 GGGAGGCAGA GCGGGTCCCA CTGTCCCCAC ACTGGCCCAG GCTGTGCAGG 
2051 TGTGCCTGGG CCCCCTAGGG TGGGGCTCAG CCAGGGGCTG CCCTCGGCAG 
2101 GGTGGGGG AT TTGCCAGCGT GGCCCTCCCT CCAGCAGCAC CTGCCCTGGG 
2151 CTGGGCCACG GGAAGCCCTA GGAGCCCCTG GGGACAGACA CACAGCCCCT 
2201 GCCTCTGTAG GAGACTGTCC TGTTCTGTGA GCGCCCCTGT CCTCCCGACC 
2251 TCCATGCCCA CTCGGGGGCA TGCCTAGTCC ATGTGCGTAG GGACAGGCCC 
2301 TCCCTCACCC ATCTACCCCC ACGGCACTAA CCCCTGGCTG CCCTGCCCAG 
2351 CCTCGCACCC GCATGGGGAC ACAACCGACT CCGGGGACAT GCACTCTCGG 
24 01 GCCCTGTGGA GGGACTGGTG CAGATGCCCA CACACACACT CAGCCCAGAC 
2451 CCGTTCAACA AACCCCGCAC TGAGGTTGGC CGGCCACACG GCCACCACAC 
2501 ACACACGTGC ACGCCTCACA CACGGAGCCT CACCCGGGCG AACTGCACAG 
2551 CACCCAGACC AGAGCAAGGT CCTCGCACAC GTGAACACTC CTCGGACACA 
2 601 GGCCCCCACG AGCCCCACGC GGCACCTCAA GGCCCACGAG CCTCTCGGCA 
2 651 GCTTCTCCAC ATGCTGACCT GCTCAGACAA ACCCAGCCCT CCTCTCACAA 
2701 GGGTGCCCCT GCAGCCGCCA CACACACACA GGGGATCACA CACCACGTCA 
2751 CGTCCCTGGC CCTGGCCCAC TTCCCAGTGC CGCCCTTCCC TGCAGGACGG 
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2 801 ATCCAGACAT GATAAGATAC ATTGATGAGT TTGGACAAAC CACAACTAGA 
2 851 ATGCAGTGAA AAAAATGCTT TATTTGTGAA ATTTGTGATG CTATTGCTTT 
2 901 ATTTGTAACC ATTATAAGCT GCAATAAACA AGTTAACAAC AACAATTGCA 
2 951 TTCATTTTAT GTTTCAGGTT CAGGGGGAGG TGTGGGAGGT TTTTTAAAGC 
3001 AAGTAAAACC TCTACAAATG TGGTATGGCT GATTATGATC TCTAGTCAAG 
3051 GCACTATACA TCAAATATTC CTTATTAACC CCTTTACAAA TTAAAAAGCT 
3101 AAAGGTACAC AATTTTTGAG CATAGTTATT AATAGCAGAC ACTCTATGCC 
3151 TGTGTGGAGT AAGAAAAAAC AGTATGTTAT GATTATAACT GTTATGCCTA 
3201 CTTATAAAGG TTACAGAATA TTTTTCCATA ATTTTCTTGT ATAGCAGTGC 
3251 AGCTTTTTCC TTTGTGGTGT AAATAGCAAA GCAAGCAAGA GTTCTATTAC 
3301 TAAACACAGC ATGACTCAAA AAACTTAGCA ATTCTGAAGG AAAGTCCTTG 
3351 GGGTCTTCTA CCTTTCTCTT CTTTTTTGGA GGAGTAGAAT GTTGAGAGTC 
3401 AGCAGTAGCC TCATCATCAC TAGATGGCAT TTCTTCTGAG CAAAACAGGT 
3451 TTTCCTCATT AAAGGCATTC CACCACTGCT CCCATTCATC AGTTCCATAG 
3501 GTTGGAATCT AAAATACACA AACAATTAGA ATCAGTAGTT TAACACATTA 
3551 TACACTTAAA AATTTTATAT TTACCTTAGA GCTTTAAATC TCTGTAGGTA 
3601 GTTTGTCCAA TTATGTCACA CCACAGAAGT AAGGTTCCTT CACAAAGATC 
3651 CGGGACCAAA GCGGCCATCG TGCCTCCCCA CTCCTGCAGT TCGGGGGCAT 
3701 GGATGCGCGG ATAGCCGCTG CTGGTTTCCT GGATGCCGAC GGATTTGCAC 
3751 TGCCGGTAGA AC TCCGCGAG GTCGTCCAGC CTCAGGCAGC AGCTGAACCA' 
3801 ACTCGCGAGG GGATCGAGCC CGGGGTGGGC GAAGAACTCC AGCATGAGAT 
3851 CCCCGCGCTG GAGGATCATC CAGCCGGCGT CCCGGAAAAC GATTCCGAAG 
3901 CCCAACCTTT CATAGAAGGC GGCGGTGGAA TCGAAATCTC GTGATGGCAG 
3951 GTTGGGCGTC GCTTGGTCGG TCATTTCGAA CCCCAGAGTC CCGCTCAGAA 
4 001 GAACTCGTCA AGAAGGCGAT AGAAGGCGAT GCGCTGCGAA TCGGGAGCGG 
4 051 CGATACCGTA AAGCACGAGG AAGCGGTCAG CCCATTCGCC GCCAAGCTCT 
4101 TCAGCAATAT CACGGGTAGC CAACGCTATG TCCTGATAGC GGTCCGCCAC 
4151 ACCCAGCCGG CCACAGTCGA TGAATCCAGA AAAGCGGCCA TTTTCCACCA 
4201 TGATATTCGG CAAGCAGGCA TCGCCATGGG TCACGACGAG ATCCTCGCCG 
4251 TCGGGCATGC GCGCCTTGAG CCTGGCGAAC AGTTCGGCTG GCGCGAGCCC 
4301 CTGATGCTCT TCGTCCAGAT CATCCTGATC GACAAGACCG GCTTCCATCC 
4351 GAGTACGTGC TCGCTCGATG CGATGTTTCG CTTGGTGGTC GAATGGGCAG 
44 01 GTAGCCGGAT CAAGCGTATG CAGCCGCCGC ATTGCATCAG CCATGATGGA 
4 451 TACTTTCTCG GCAGGAGCAA GGTGAGATGA CAGGAGATCC TGCCCCGGCA 
4 501 CTTCGCCCAA TAGCAGCCAG TCCCTTCCCG CTTCAGTGAC AACGTCGAGC 
4 551 ACAGCTGCGC AAGGAACGCC CGTCGTGGCC AGCCACGATA GCCGCGCTGC 
4 601 CTCGTCCTGC AGTTCATTCA GGGCACCGGA CAGGTCGGTC TTGACAAAAA 
4 651 GAACCGGGCG CCCCTGCGCT GACAGCCGGA ACACGGCGGC ATCAGAGCAG 
4701 CCGATTGTCT GTTGTGCCCA GTCATAGCCG AATAGCCTCT CCACCCAAGC 
4751 GGCCGGAGAA CCTGCGTGCA ATCCATCTTG TTCAATCATG CGAAACGATC 
4 801 CTCATCCTGT CTCTTGATCA GATCTTGATC CCCTGCGCCA TCAGATCCTT 
4 851 GGCGGCAAGA AAGCCATCCA GTTTACTTTG CAGGGCTTCC CAACCTTACC 
4 901 AGAGGGCGCC CCAGCTGGCA ATTCCGGTTC GCTTGCTGTC CATAAAACCG 
4 951 CCCAGTCTAG CTATCGCCAT GTAAGCCCAC TGCAAGCTAC CTGCTTTCTC 
5001 TTTGCGCTTG CGTTTTCCCT TGTCCAGATA GCCCAGTAGC TGACATTCAT 
5051 CCGGGGTCAG CACCGTTTCT GCGGACTGGC TTTCTACGTG TTCCGCTTCC 
5101 TTTAGCAGCC CTTGCGCCCT GAGTGCTTGC GGCAGCGTGA AGCTAGCTTT 
5151 TTGCAAAAGC CTAGGCCTCC AAAAAAGCCT CCTCACTACT TCTGGAATAG 
5201 CTCAGAGGCC GAGGCGGCCT CGGCCTCTGC ATAAATAAAA AAAATTAGTC 
5251 AGCCATGGGG CGGAGAATGG GGCGGGATGG GCGGAGTTAG GGCGGAACTG 
5301 GGCGGAGTTA GGGGCGGGAC TATGGTTGCT GACTAATTGA GATGCATGCT 
5351 TTGCATACTT CTGCCTGCTG GGGAGCCTGG GGACTTTCCA CACCTGGTTG 
54 01 CTGACTAATT GAGATGCATG CTTTGCATAC TTCTGCCTGC TGGGGAGCCT 
5451 GGGGACTTTC CACACCCTAA CTGACACACA TTCCACAGCT GCCTCGCGCG 
5501 TTTCGGTGAT GACGGTGAAA ACCTCTGACA CATGCAGCTC CCGGAGACGG 
5551 TCACAGCTTG TCTGTAAGCG GATGCCGGGA GCAGACAAGC CCGTCAGGGC 
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5601 GCGTCAGCGG GTGTTGGCGG GTGTCGGGGC GCAGCCATGA CCCAGTCACG 
5651 TAGCGATAGC GGAGTGTATA CTGGCTTAAC TATGCGGCAT CAGAGCAGAT 
5701 TGTACTGAGA GTGCACCATA TGCGGTGTGA AATACCGCAC AGATGCGTAA 
5751 GGAGAAAATA CCGCATCAGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG 
5801 CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC 
5851 GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT 
5901 GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 
5951 GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCGACG 
6001 CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 
6051 TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT 
6101 ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA 
6151 TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC 
6201 TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC 
6251 GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT 
6301 GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 
6351 CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA 
64 01 GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT 
6451 TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT 
6501 TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT 
6551 CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTCACG 
6601 TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC 
6651 TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA 
6701 ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC 
6751 GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA 
6801 TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA 
6851 CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC 
6901 AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 
6951 TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 
7001 AATAGTTTGC GCAACGTTGT TGCCATTGCT GCAGGCATCG TGGTGTCACG 
7051 CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC 
7101 GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT 
7151 CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT 
7201 TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 
7251 TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG 
7301 CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ACACGGGATA ATACCGCGCC 
7351 ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC 
7401 GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC 
7451 ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC 
7501 TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 
7551 CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA 
7 601 AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT 
7 651 TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC 
7701 CACCTGACGT CTAAGAAACC ATTATTATCA TGACATTAAC CTATAAAAAT 
7751 AGGCGTATCA CGAGGCCCTT TCGTCTTCAA GAATTCATCG ATATCGGAAA 
7801 ATGAAAAAAA ATATTTTTTA ATTTTAAAAT GAAATGTTTA TTTTCAATTT 
7851 CTCCAAATTT CACAAGGAAA GATTAGTCAC GGGTATGGGA GAGCAGAGGA 
7 901 CCATAAGAGT TCAGGAATAG AATCCATTAT GATTCTGGAG TCAAGGAAGT 
7 951 ACTGATGCCA AGGTTTCAGT ATAAGAGCAG TATCCACTGG AAAGGATAAA 
8001 GTCACTACAA CTGAGCACAG AGCAGGACAG CTACCTAATG AGTGGTCACT 
8051 AATGGGCCAC TGTTACACTG TTATACGGCT TAGGAATGAG CACTGAGGCT 
8101 GTGAGGTGTA TGGGTAAGGA CATCAGGATG TAAACCCAGC TCAGGTAGAG 
8151 GACTCAGAGC ACAGCACAAT CAGCACGAAC TAATAAACAA CAGATAAGAT 
8201 AAGGCACAAG CTCAGCAATA TTGGATCAGG GATCTTTGTA AATCTGACTG 
8251 TGTATTCAGT CTAGTTCAAT GTGACTCATG AAGCCCACCC ATATGCAAAT 
8301 CTAGAGAAGA CTTTAGAGTA TAAATCTGAG GCTCACCTCA CATACCAGCA 
8351 AGGGAGTGAC CAGCTTGTCT TAAGGCACCA CTGAGCCCAA GTCTTAGACA 
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8401 TCATGGATTG GCTGTGGAAC TTGCTATTCC TGATGGCAGC TGCCCAAGGT 
8451 AAGTCATCAG AAAAAAGAGT TCCAAGGGAA ATTGAAGCAG TTCCGAGCTC 
8501 GGTACCCTCG AGATCCTAGA GAGGTCTGGT GGAGCCTGCA AAAGTCCAGC 
8551 TTTCAAAGGA ACACAGAAGT ATGTGTATGG AATATTAGAA GATGTTGCTT 
8601 TTACTCTTAA GTTGGTTCCT AGGAAAAATA GTTAAATACT GTGACTTTAA 
8651 AATGTGAGAG GGTTTTCAAG TACTCATTTT TTTAAATGTC CAAAATTTTT 
8701 GTCAATCAAT TTGAGGTCTT GTTTGTGTAG AACTGACATT ACTTAAAGTT 
8751 TAACCGAGGA ATGGGAGTGA GGCTCTCTCA TACCCTATCC AGAACTGACT 
8801 TTTAACAATA ATAAATTAAG TTTAAAATAT TTTTAAATGA ATTGAGCAAT 
8851 GTTGAGTTGA GTCAAGATGG CCGATCAGAA CCAGAACACC TGCAGCAGCT 
8901 GGCAGGAAGC AGGTCATGTG GCAAGGCTAT TTGGGGAAGG GAAAATAAAA 
8951 CCACTAGGTA AACTTGTAGC TGTGGTTTGA AGAAGTGGTT TTGAAACACT 
9001 CTGTCCAGCC CCACCAAACC GAAAGTCCAG GCTGAGCAAA ACACCACCTG 
9051 GGTAATTTGC ATTTCTAAAA TAAGTTGAGG ATTCAGCCGA AACTGGAGAG 
9101 GTCCTCTTTT AACTTATTGA GTTCAACCTT TTAATTTTAG CTTGAGTAGT 
9151 TCTAGTTTCC CCAAACTTAA GTTTATCGAC TTCTAAAATG TATTTAGAAT 
9201 T 
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